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Abstract. We prove a boundary Harnack inequality for jump-type Markov processes 
on metric measure state spaces, under comparability estimates of the jump kernel and 
Urysohn-type property of the domain of the generator of the process. The result holds for 
positive harmonic functions in arbitrary open sets. It applies, e.g., to many subordinate 
Brownian motions, Levy processes with and without continuous part, stable-like and 
censored stable processes, jump processes on fractals, and rather general Schrodinger, 
drift and jump perturbations of such processes. 



I. Introduction 

The boundary Harnack inequality (BHI) is a statement about nonnegative functions 
which are harmonic on an open set and vanish outside the set near a part of its boundary. 
BHI asserts that the functions have a common boundary decay rate. The property 
requires proper assumptions on the set and the underlying Markov process, ones which 
secure relatively good communication from near the boundary to the center of the set. 
By this we mean that the process starting near the boundary visits the center of the set 
at least as likely as creep far along the boundary before leaving the set. 

BHI for harmonic functions of the Laplacian A in Lipschitz domains was proved in 
1977-78 by B. Dahlberg, A. Ancona and J.-M. Wu ([4, 37, 80]), after a pioneering at- 
tempt of J. Kemper ([55, 56]). In 1989 R. Bass and K. Burdzy proposed an alternative 
probabilistic proof based on elementary properties of the Brownian motion ([13]). The 
resulting 'box method' was then applied to more general domains, including Holder do- 
mains of order r > 1/2, and to more general second order elliptic operators ([14, 15]). BHI 
trivially fails for disconnected sets, and counterexamples for Holder domains with r<l/2 
are given in [15]. In 2001-09, H. Aikawa studied BHI for classical harmonic functions in 
connection to the Carleson estimate and under exterior capacity conditions ([1, 2, 3]). 

Moving on to nonlocal operators and jump-type Markov processes, in 1997 K. Bogdan 
proved BHI for the fractional Laplacian A"/ 2 (and the isotropic a-stable Levy process) for 
< a < 2 and Lipschitz sets ([19]). In 1999 R. Song and J.-M. Wu extended the results 
to the so-called fat sets ([73]), and in 2007 K. Bogdan, T. Kulczycki and M. Kwasnicki 
proved BHI for A a / 2 in arbitrary, in particular disconnected, open sets ([26]). In 2008 
P. Kim, R. Song and Z. Vondracek proved BHI for subordinate Brownian motions in 
fat sets ([61]) and in 2011 extended it to a general class of isotropic Levy processes and 
arbitrary domains ([63]). Quite recently, BHI for A + A a//2 was established by Z.-Q. Chen, 
P. Kim, R. Song and Z. Vondracek [32]. We also like to mention BHI for censored [43] 
by Q. Guan and fractal jump processes [53, 75] by K. Kaleta and M. Kwasnicki. 
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Generally speaking, BHI is more a topological issue for diffusion processes, and more 
a measure-theoretic issue for jump-type Markov processes, which may transport from 
near the boundary to the center of the set by direct jumps. However, direct jumps fully 
explain the asymptotics of harmonic functions only at points where the set is rather thin, 
and for other points a more complex interplay occurs between large jumps and gradual 
'excursions' away from the boundary, see [19, 26] (the corresponding classification of the 
boundary points seems to be of considerable importance). 

We recall that BHI in particular applies to and may yield an approximate factorization 
of the Green function. This line of research was completed for Lipschitz domains in 
2000 by K. Bogdan ([20]) for A and in 2002 by T. Jakubowski ([50]) for A°/ 2 . It is 
now a well-established technique ([46]) and extensions were proved, e.g., for subordinate 
Brownian motions by P. Kim, R. Song and Z. Vondracek ([64]). We should note that so 
far the technique is typically restricted to Lipschitz or fat sets. Furthermore, for smooth 
sets, e.g. C 1 ' 1 sets, the approximate factorization is usually more explicit. This is so 
because for smooth sets the decay rate in BHI can often be explicitly expressed in terms 
of the distance to the boundary of the set. The first complete results in this direction 
were given for A in 1986 by Z. Zhao ([81]) and for A a / 2 in 1997 by T. Kulczycki ([65]) 
and in 1998 by Z.-Q. Chen and R. Song ([35]). The estimates are now extended to 
subordinate Brownian motions, and the renewal function of the subordinator is used in 
the corresponding formulations ([64]). Accordingly, the Green function of smooth sets 
enjoys approximate factorization for rather general isotropic Levy processes ([29, 64]). 
We expect further progress in this direction with applications to perturbation theory via 
the so-called 3G theorems, and to nonlinear partial differential equations ([25, 46, 68]). 
We should also mention estimates and approximate factorization of the Dirichlet heat 
kernels, which are intensively studied at present. The estimates depend on BHI ([24]), 
and reflect the fundamental decay rate in BHI ([31, 45]). 

BHI tends to self-improve and may lead to the existence of the boundary limits of 
ratios of nonnegative harmonic functions, thanks to oscillation reduction ([13, 19, 26, 
52]). The oscillation reduction technique is rather straightforward for local operators. 
It is more challenging for non-local operators, as it involves subtraction of harmonic 
functions, which destroys global nonnegativity. The technique requires a certain scale 
invariance, or uniformity of BHI, and works, e.g., for A in Lipschitz domains ([13]) and for 
A Q//2 in arbitrary domains ([26]). We should remark that Holder continuity of harmonic 
functions is a similar phenomenon, related to the usual Harnack inequality, and that BHI 
extends the usual Harnack inequality if, e.g., constant functions are harmonic. Holder 
continuity of harmonic functions is crucial in the theory of partial differential equations 
[6, 16], and the existence of limits of ratios of nonnegative harmonic functions leads to 
the construction of the Martin kernel and to representation of nonnegative harmonic 
functions ([5, 26]). 

The above summary indicate further directions of research resulting from our develop- 
ment. The main goal of this article is to study the following boundary Harnack inequality. 
In Section 2 we specify notation and assumptions which validate the estimate. 

(BHI) Let x G X, < r < R < R , and let D C B(x , R) be open. Suppose that 
nonnegative functions /, g on X are regular harmonic in D with respect to the 
process X t , and vanish in B(xq, R) \ D. There is C(i.i) = C(i.i)(xq, r, R) such that 



f(x)g(y) < c(i.i) f(y)g(x) , 



x,y e B(x ,r). 



(1.1) 
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Here X t is a Hunt process, having a metric measure space X as the state space, and 
Ro G (0, oo] is a localization radius (discussed in Section 2). Also, a nonnegative function 
/ is said to be regular harmonic in D with respect to X t if 

f(x) = EJ(X TD ), xeD, (1.2) 

where To is the time of the first exit of X t from D. To facilitate cross-referencing, in (1.1) 
and later on we let cm denote the constant in the displayed formula (i). By c or q we 
denote secondary (temporary) constants in a lemma or a section, and c = c(a, . . . , z), or 
simply c(a, . . . , z), means a constant c that may be so chosen to depend only on a, . . . , z. 
Throughout the article, all constants are positive. 

The present work started with an attempt to obtain bounded kernels which reproduce 
harmonic functions. We were motivated by the so-called regularization of the Poisson 
kernel for A Q//2 ([22], [26, Lemma 6]), which is crucial for the Carleson estimate and BHI 
for A Q//2 . In the present paper we construct kernels obtained by gradually stopping the 
Markov process with a specific multiplicative functional before the process approaches the 
boundary. The construction is the main technical ingredient of our work, and is presented 
in Section 4. The argument is intrinsically probabilistic and relies on delicate analysis on 
the path space. At the beginning of Section 4 the reader will also find a short informal 
presentation of the construction. Section 2 gives assumptions and auxiliary results. The 
boundary Harnack inequality (Theorem 3.5), and the so-called local supremum estimate 
(Theorem 3.4) are presented in Section 3, but the proof of Theorem 3.4 is deferred to 
Section 4. In Section 5 we verify in various settings the scale-invariance of BHI, discuss 
the relevance of our main assumptions from Section 2, and present many applications, 
including subordinate Brownian motions, Levy processes with or without continuous 
part, stable-like and censored processes, Schrodinger, gradient and jump perturbations, 
processes on fractals and more. 

2. Assumptions and Preliminaries 

Let (3£, d, m) be a metric measure space such that all bounded closed sets are compact 
and m has full support. Let B(x,r) = {y G X : d(x,y) < r}, where x G X and r > 0. 
All sets, functions and measures considered in this paper are Borel. Let Ro G (0, oo] (the 
localization radius) be such that X\B(x, 2r) ^ for all x G X and all r < R . Let XU{d} 
be the one-point compactification of X (if X is compact, then we add d as an isolated 
point). Without much mention we extend functions / on X to XU{d} by letting f(d) = 0. 
In particular, we write / G Cq{X) if / is a continuous real- valued function on X U {d} 
and f[d) = 0. If furthermore / has compact support in X, then we write / G C C (X). For 
a kernel kix,dy) on X ([38]) we let kf(x) = f f(y)k(x,dy), provided the integral makes 
sense, i.e., / is (measurable and) either nonnegative or absolutely integrable. Similarly, 
for a kernel density function k(x,y) > 0, we let kix,E) = f E k{x,y)m(dy) and k{E,y) = 
J E k(x, y)m(dx) for BC1 

Let (X t X,-M t ,P x ) be a Hunt process with state space X (see, e.g., [18, 1.9] or [39, 
3.23]). Here X t are the random variables, Ai t is the usual right- continuous filtration, 
P x is the distribution of the process starting from x G X, and E x is the corresponding 
expectation. The random variable ( G (0, oo] is the lifetime of X t , so that X t = d for 
t > £■ This should be kept in mind when interpreting (1.2) above, (2.1) below, etc. The 
transition operators of X t are defined by 



TJ(x) = E x f(X t ), 



t > 0, x G X, 



(2.1) 
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whenever the expectation makes sense. We assume that the semigroup T t is Feller and 
strong Feller, i.e., for t > 0, T t maps bounded functions into continuous ones and C (X) 
into C (X). The Feller generator A of X t is defined on the set T>(A) of all those / G C (X) 
for which the limit 

Af(x)=\im TJ{x) - f{x) 



t\o t 

exists uniformly in x 6 I. The a-potential operator, 

/»oo /*0O 

U a f{x) = E x / f(X t )e- at dt = / e~ at T t f(x)dt, a > 0, x G X, 

Jo Jo 

is defined whenever the expectation makes sense. We let U = U®, the potential operator. 
The kernels of T t , lA a and U. are denoted by T t (x, dy), U a {x, dy) and U (x, dy), respectively. 

Recall that a function / > is called a-excessive (with respect to T t ) if for all x £ X, 
e- at T t f(x) < f(x) for t > 0, and e~ at T t f(x) -> f(x) as t -> 0+. When a = 0, we simply 
say that / is excessive. 

We enforce a number of conditions, namely Assumptions A, B, C and D below. We 
start with a duality assumption, which builds on our discussion of X t . 

Assumption A. There are Hunt processes X t and X t which are dual with respect to 
the measure m (see [18, VI. 1] or [36, 13.1]). The transition semigroups of X t and X t are 
both Feller and strong Feller. Every semi-polar set of X t is polar. 

In what follows, objects pertaining to X t are distinguished in notation from those for 
X t by adding a hat over the corresponding symbol. For example, T t and lA a denote the 
transition and a-potential operators of X t . The first sentence of Assumption A means 
that for all a > 0, there are functions U a (x,y) = I4 a (y,x) such that 



UJ{x) = J U a (x,y)f(y)m(dy), U a f(x) = j U a (x,y)f(y)m(dy) 

for all / > and x G X, and such that x i— y U a (x, y) is a-excessive with respect to T t , and 
y i — y U a (x,y) is a-excessive with respect to T t (that is, a-co-excessive) . The a-potential 
kernel U a (x, y) is unique (see [36, Theorem 13.2] or remarks after [18, Proposition VI. 1.3]). 

The condition in Assumption A that semi-polar sets are polar is also known as Hunt's 
hypothesis (H). Most notably, it implies that the process X t never hits irregular points, 
see, e.g., [18, 1. 11 and II. 3] or [36, Chapter 3]. The a-potential kernel is non-increasing in 
a > 0, and hence the potential kernel U(x, y) = lim t _> + U a (x, y) G [0, oo] is well-defined. 

We consider an open set D C X and the time of the first exit from D for X t and X t , 

r D = mf{t >0:X t (£D} and f D = inf{t >0:X t <£ D}. 

We define the processes killed at td, 

X t D =lf' ift<TD ' and X? = 

[d, if t>r D , 

We let Tf(x,dy) and Tf(x,dy) be their transition kernels. By [36, Remark 13.26], Xf 
and Xf are dual processes with state space D. Indeed, for each x G D, P^-a.s. the 
process X t only hits regular points of X \ D when it exits D. In the nomenclature of [36, 
13.6], this means that the left-entrance time and the hitting time of X \ D are equal 
P x -a.s. for every x G D. In particular, the potential kernel Gu(x,y) of X® exists and is 
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unique, although in general it may be infinite ([18, pp. 256-257]). Go(x,y) is called the 
Green function for X t on D, and it defines the Green operator Go, 

G D f(x) = [ f(y)G D (x, y)m(dy) = E x [ ° f(X t )dt, xeX,f>0. 
Jx Jo 

Note that U(x,y) = Gx(x,y). When X t is symmetric (self-dual) with respect to m, then 
Assumption A is equivalent to the existence of the a-potential kernel U a (x,y) for X t , 
since then Hunt's hypothesis (H) is automatically satisfied, see [36]. 

The following regularity hypothesis plays a crucial role in our paper, providing enough 
'smooth' functions on X to approximate indicator functions of compact sets. 

Assumption B. There is a linear subspace D of T>(A) D T>{A) satisfying the following 
condition. If K is compact, D is open, and K C D C X, then there is / G T> such that 
f{x) — 1 for x G K, f(x) = for x G X \ D, < f(x) < 1 for x G X, and the boundary 
of the set {x : f(x) > 0} has measure m zero. We let 

g{K,D) =wfsupmax(Af(x),Af(x)), (2.2) 

/ xex 

where the infimum is taken over all such functions /. 

Thus, nonnegative functions in T>(A) H T>(A) separate the compact set K from the 
closed set X\D: there is a Urysohn (bump) function for K and X \ D in the domains. 

Note that constant functions are not in T>(A) nor T>(A) unless X is compact. In the 
Euclidean case X = R d , T> can often be taken as the class C^°(R d ) of compactly supported 
smooth functions. The existence of T> is problematic if X is more general. However, for 
the Sierpihski triangle and some other self-similar (p.c.f.) fractals, T> can be constructed 
by using the concept of splines on fractals ([53, 76]). Also, a class of smooth indicator 
functions was recently constructed in [69] for heat kernels satisfying upper sub-Gaussian 
estimates on X. Further discussion is given in Section 5 and Appendix A. Here we note 
that Assumption B implies that the jumps of X t are subject to the following identity, 
which we call the Levy system formula for X t , 

E x V f{s,X 8 -,X a ) =E X [ [ f(s,X s .,z)u(X s _,dz)ds. (2.3) 
se[o,t] Jo Jx 

Here / : [0, oo) x X x X — > [0, oo], f(x,x) = for all x G X, and v is a kernel on X 
(satisfying u(x, {x}) = for all x G X), called the Levy kernel of X t , see [17, 72, 78]. For 
more general Markov processes, ds in (2.3) is superseded by the differential of a perfect, 
continuous additive functional, and (2.3) defines u(x, •) only up to a set of zero potential, 
that is, for m-almost every x G X. By inspecting the construction in [17, 72], and using 
Assumption B, one proves in a similar way as in [12, Section 5] that the Levy kernel v 
satisfies 

uf{x) = Urn , / G C C (X), xeX\ supp /. (2.4) 

This formula, as opposed to (2.3), defines v(x, dy) for all x G X. With only one exception, 
to be discussed momentarily, we use (2.4) and not (2.3), hence we take (2.4) as the 
definition of v. It is easy to see that (2.4) indeed defines u(x,dy): if / G T>(A) and 
x G X \ supp/, then vf{x) = Af(x). By Assumption B, the mapping / \-t vf(x) is a 
densely defined, nonnegative linear functional on C C {X \ {x}), hence it corresponds to a 
nonnegative Radon measure u(x,dy) on X \ {x}. As usual, we let is(x, {x}) = 0. The 
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Levy kernel 0(y, dx) for X t is defined in a similar manner. By duality, v{x, dy)m{dx) 
i>(y, dx)m(dy). 

As an application of (2.3) we consider the martingale 



t-> V f(s,X s _,X s )- [ [ f(s,X s _,z)v(X s 

-f^wi JO JX 



dz)ds, 



se[o,t] 

where f(s,y,z) = 1a(s)1e(?/)1f(<s)- We stop the martingale at td and we see that 

P x {td G rft, X TD _ G dy, X TD G dz) = dt T t D (x, dy)u(y, dz), (2.5) 

on (0, oo) x D x (X\D). A similar result was first proved in [49]. For this reason we refer 
to (2.5) as the Ikeda-Watanabe formula (see also (2.12) and (2.6) below). Integrating 
(2.5) against dt and dy we obtain 

P x (X TD _^X TD ,X TD eE)= [ G D (x,dy)u(y,E), xeD,EcX\D. (2.6) 

Jd 

For x G X and < r < R, we consider the open and closed balls B(x ,r) = 
{x <E X : d(xo,x) < r} and B(xq,t) = {x G X : d(xo,x) < r}, and the annular regions 
A(x , r, R) = {x £ X : r < d(x , x) < R} and A(x , r,R) = {x G X : r < d(x , a?) < i?}. 
Note that 5(x ,t), the closure of B(x ,r), may be a proper subset of S(a; ,r). 

Recall that i?o denotes the localization radius of X. The following assumption is our 
main condition for the boundary Harnack inequality. It asserts a relative constancy of 
the density of the Levy kernel. This is a natural condition, as seen in Example 5.14. 

Assumption C. The Levy kernels of the processes X t and X t have the form v{x, y)m{dy) 
and P(x,y)m(dy) respectively, where v{x,y) = v(y,x) > for all x, y G X, x ^ y. For 
every x <EX, 0<r<R< R , x G B(x , r) and y G X \ B(x , R), 

c p.7) u ( x o, V) < v{x, V) < C(2.7)^(«o, y), C(2. 7 )H x o, y) < u(x, y) < c (2 .7)Z>(a;o, y), (2.7) 

with c (2 .7) = C( 2 . 7 )(x ,r, J R). 

It follows directly from Assumption C that for xq G X and < r < R, 

C(2.$)(xo,r,R) = _inf min(v(x ,y), u(x ,y)) > (2.8) 

y&A{x ,r,R) 

where A(x , r, R) = {x G X : r < <i(x , x) < i?}. (Here we do not require that R < R .) 
Indeed, we may cover A(xo,r, R) by a finite family of balls B(yi,r/2), where yi G 
A(x ,r, R). For y G B(yi,r/2), z/(x ,y) is comparable with i/(x ,?/i), and u(x 0l y) is 
comparable with z>(xo, - 

Proposition 2.1. If xq £ X and < r < i? , t/ien 

C(2.9)(x ,r) = sup m&x(E x T B ( XOtr ),E x f B (x ,r)) < oo. (2.9) 

Proo/. Let i? = 5(a;o,r), R G (r,i? ), x,y e B and F(t) = P x (r B > t). By the definition 
of R , m{X\B(x ,R)) > 0. This and (2.7) yield v(y,X\B) > u(y, X \ B(x , R)) > 
c~7 27 sI>(xq, X\B(x , R)) = c, where c = c(x , r, -R). By the Ikeda-Watanabe formula (2.5), 

= P B (r B G dt) > P x (r B G dt, A TB _ £ X TB ,X TB G X \ £?) 
^ ' dt dt 

= [ u(y,X\B)T t B (x,dy)>c [ T t B \x , dy) = cF (t) . 

./£ Jx 

Hence P x (r B > t) < e~ c *. If follows that E x r B < 1/c. Similarly, E^fe < 1/c. □ 
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In particular, if < R < Rq and D C B(xo,R), then the Green function Gd(x,u) 
exists (see the discussion following Assumption A), and for each x G X it is finite for all 
y in X less a polar set. We need to assume slightly more. The following condition may 
be viewed weak version of Harnack's inequality. 

Assumption D. If xo G X, < r < p < R < Rq and B = B(xq, R), then 

C(2.w)(,x ,r,p, R) = sup sup max (G B (x, y), G B (x, y)) < oo. (2.10) 

xeB(x ,r) y&X\B(x ,p) 

Assumptions A, B, C and D are tacitly assumed throughout the entire paper. We recall 
them explicitly only in the statements of BHI and local maximum estimate. 

When saying that a statement holds for almost every point of X, we refer to the measure 
m. The following technical result is a simple generalization of [18, Proposition II. 3. 2]. 

Proposition 2.2. Suppose that Y t is a standard Markov process such that for every x G X 
and a > 0, the a-potential kernel V a (x,dy) ofY t is absolutely continuous with respect to 
midy). Suppose that function f is excessive for the transition semigroup of Y t , and f 
is not identically infinite. If function g is continuous and f(x) < g(x) for almost every 
x G B(x ,r), then f\x) < g{x) for every x G B(x ,r). 

Proof. Let A = {x G B(xo,r) : f(x) > g{x)}. Then m(A) = 0, so that A is of zero 
potential for Y. Hence B(xo,r) \ A is finely dense in B(xo,r). Since / — g is finely 
continuous, we have f(x) < g(x) for all x G B(x ,r), as desired. (See e.g. [18, 36] for the 
notion of fine topology and fine continuity of excessive functions.) □ 

If X t is transient, (2.10) often holds even when Gb is replaced by G% = U. In the 
recurrent case, we can use estimates of U a , as follows. 

Proposition 2.3. If x e X, < r < p < R < R , a > 0, 

ci(x , r, p, a) = sup sup ma,x(U a (x, y), U a (x, y)) < oo, 

x£B(x ,r) y£X\B(x ,p) 

and T t (x,dy) < C2(t)m(dy) for all x,y G X, t > 0, then in (2.10) we may let 
C(2.io)(x ,r,p,R) = jnf Q (e a *Ci(x , r,p, a) + c 2 (t)c {2 .9)(xo, R)) ■ 

Proof. Denote B = B(x , R). If x G B(x , r), t > and E C B \ B(x ,p), then 

POO 

G B l E (x)= / T t B l E (x)dt 



JO 

rto roo 

<e at0 e- at T t B l E (x)dt+ Tf(T t B Q l E )(x)ds 
Jo Jo 

/■oo / \ POO 

< e at0 / e~ at T t l E (x)dt + su P T t B Q l E (y) / T B l(y)ds 

Jo \yeB J Jo 

< e at0 U a l E (x) + (supT to l E (y) j G B l(x) 

\yeB J 

< (e at ° Cl + c 2 G B l(x))\E\, 

where c\ = ci(xo,r,p,a) and c 2 = c 2 (t ). If y G B \ B(xo,p), then by Proposition 2.2, 
G B (x,y) < e ato d + c 2 G B l{x). By Proposition 2.1, G B l(x) = E x t b < c {2 . 9) (x , R). The 
estimate of G B (x,y) is similar. □ 
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We use the standard notation E X (Z;E) = E x (Z1e)- Recall that all functions / on 1 
are automatically extended to IU {d} by letting f(d) = 0. In particular, we understand 
that Af(d) = for all / G V(A), and E x Af{X T ) = E x {Af{X T ); t < (). 

The following formula obtained by Dynkin (see [39, formula (5.8)]) plays an important 
role. If t is a Markov time, E x r < oo and / G T>(A), then 

EJ(X T ) = f(x) + E, / Af(X t )dt, x G X. (2.11) 

Jo 

If / G V(A) is supported in X \ D and X t G D P y -a.s. for t < r and x£l, then 
E x f(X T ) = E x [ ( [ v{X t ,y)f(y)m{dy)) dt 

(2.12) 



E X (J u{X t ,y)dtj f(y)m{dy). 



Recall that a function f > on X is regular harmonic in an open set D C X if 
/(x) = E a ,/(X(r£ ) )) for all x G X. By the strong Markov property we then have 
f(x) = E x f(X(ru)) for all open sets U C D. Accordingly, we call / > regular 
subharmonic in D (for X t ), if f(x) < E x f(X(Tjj)) for all open sets U C D and all x E U. 
We like to recall that / > is called harmonic in if f(x) = E x f(X(Tjj)) for all open 
and bounded U such that U C D, and all x £ U. This condition is satisfied, e.g., by the 
Green function Gr,(-,y) in D \ {y}, and it is weaker than regular harmonicity. In this 
work however, only the notion of regular harmonicity is used. 

3. Boundary Harnack inequality 

Recall that Assumptions A, B, C and D are in force throughout the entire paper. Some 
results, however, hold in greater generality. For example, the following Lemma 3.1 relies 
solely on Assumption B and (2.9), and it remains true also when X t is a diffusion process. 
Also, Lemma 3.2 and Corollary 3.3 require Assumptions B and C but not A or D. 

Lemma 3.1. If xq G X and 0<r<R<R< oo, then for all D C B(xo, R) we have 

P X (X TD G A{x , R, R)) < c {zs) E x t d , x G B(x , r) n D, (3.1) 

where c (3 .i) = c^.i)(x ,r, R, R) = ixd f>A g(A(x ,R,R),A(x ,r,r)). 

Proof. We fix an auxiliary number r > R and x G B(xo,r). Let / G T> be a bump 
function from Assumption B for the compact set A(xq, R, R) and the open set A(xo, r, f). 
Thus, / G V(A), f(x) = 0, f(y) = 1 for y G A(x , R, R) and < f(y) < 1 for all y G X. 
By Dynkin 's formula (2.11) we have 

P X (X TD G X \ B(x , R)) < E x (f(X TD )) - f(x) = G D (Af)(x) < G D l(x) sup Af(y). 

Since Gr,\{x) = E x t d , the proof is complete. □ 

We write / ~ eg if c~ x g < f < eg. We will now clarify the relation between BHI and 
local supremum estimate. 

Lemma 3.2. The following conditions are equivalent: 

(a) If xo G X ; < r < R < R , D C B(xq, R) is open, f is nonnegative, regular 
harmonic in D and vanishes in B(xq, R)\D, then 



f(x) < C( 3 .ii) / f(y)p(x ,y)m(dy) (3.2) 

>X\B(xo,r) 
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for x G B(xo, r) fl D, where C( 3 .n) = C( 3 .n)(xo, r, R). 
(b) If Xq G X, < r < p < q < R < R , D C B(x ,R) is open, f is nonnegative, 
regular harmonic in D and vanishes in B(xq, R)\D, then 

f(x) ~ c {3 . 3 )E x (T DnB (x ,p)) / f(y)u(x ,y)m(dy) (3.3) 

JX\B(xo,q) 

forx G B(x , r) fl -D, w/iere c (3 . 3) = C( 3 .3)(x , r,p, g, R). 
In fact, if (a) holds, then we may let 

C(3.3)(x ,r,p,q,R) = c {xl) (x Q} r,p,q)c {xll) (x ,q } R) + c (2 .7)(x ,p, q), 

and if (b) holds, then we may let 

C(3.u)(xo,r,R) = inf c (3 . 3) (x ,r,p, q, R)c {2 .9)(x , R). 

r<p<q<R 

Proof. Since X\B(x ,q) C X\ J B(a; ,r) and E z (r DnB(;EOi p ) ) < E x (t b(x0iR) ) < C( 2 .9)(x , 
we see that (b) implies (a) with C( 3 .n) = C( 3 3 )(x , r, p, q, i?)c( 2 .9)(x , -R). Below we prove 
the converse. Let (a) hold, and U = D fl B(x ,p). We have 

/(x) = E x (/(X Tl7 );X Tl7 G S(x ,g)) +E x (/(X Tt ,);X n , el\%?)). (3.4) 

Denote the terms on the right hand side by I and J, respectively. By (3.1) and (3.2), 

0<I<P x (X Tu EA(x ,p,q)) sup f{y) 

yeB(x ,q) 

f (3-5) 
< C(3.i)C( 3 .ii)E a .T l 7 / f(y)v{x ,y)m{dy), 

JX\B(x ,q) 

with C( 3 .i)(x , r,p, g) and C( 3 .n)(x , -R). For J, the Ikeda-Watanabe formula (2.12) yields 



J= I \ I G u(x 1 z)iy(z,y)f(y)m(dz) ) m(dy) 

JX\B(x ,q) 



w c(2.7) / _ / Gu(x,z)u(x ,y)f(y)m(dz))m(dy) (3.6) 

JX\B(x ,q) \JU J 

= c {2 .7)E x T U / u(x ,y)f(y)m(dy), 

JX\B(x ,q) 

with constant C( 2 .7)(x ,p, Formula (3.3) follows, as we have C( 3 .i)C( 3 .n) + C( 2 .7) in the 
upper bound and l/c( 2 .7) in the lower bound. □ 

We like to remark that BHI boils down to the approximate factorization (3.3) of f(x) = 
P x (X(t d ) G E). We also note that P x (X(t d ) G E) w v{x q , E)E x t d , if E is far from 
B(xq, R), since then u(z, E) w ^(x , -E) in (2.6). However, 1/(2, i?) in (2.6) is quite singular 
and much larger than u(xq, E) if both z and E are close to dB(xo, R). Our main task is to 
prove that the contribution to (2.6) from such points z is compensated by the relatively 
small time spent there by when starting at x G D. In fact, we wish to control (2.6) 
by an integral free from singularities (i.e. (3.2)), if x and E are not too close. 

By substituting (3.3) into (1.1), we obtain the following result. 

Corollary 3.3. The conditions (a), (b) of Lemma 3.2 imply (BHI) with 
C(i.i)(x ,r,R) = inf (c {3 . 3) (x , r,p, q, R)) A . □ 

r<p<q<R 

The main technical result of the paper is the following local supremum estimate for sub- 
harmonic functions, which is of independent interest. The result is proved in Section 4. 
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Theorem 3.4. Suppose that Assumptions A, B, C and D hold true. Let xq G X and 
0<r<q<R< R , where Rq is the localization radius from Assumptions C and D. Let 
function f be nonnegative on X and subharmonic with respect to X t in B(xq,R). Then 

f{y)^xo,r,q,R{y)m{dy), xeB(x ,r), (3.7) 

X\B(x ,q) 




C(3. 9 )5 for y G B(x , R) \ B(x , q), 

2c (3 .9) min(5, u(y, B(x , R))) for y Gl\ B(x , R), 



(3i 



5 = q(B(xq, q), B(xo, R)) (see Assumption B), and 

l B\ ■ t f l T3\A C(2.9)(XO,R)(c (2 .7)(Xo,p,q)) 2 \ 

c {3 . 9) (x ,r,q,R) = (c (2 . 10) (x ,r,p,R) + m(B(xo>p)) ) " ( 3 ' 9 ) 

Theorem 3.4 (to be proved in the next section) and Corollary 3.3 lead to BHI. We note 
that no regularity of the open set D is assumed. 

Theorem 3.5. If assumptions A, B, C and D are satisfied, then (BHI) holds true with 
C(i.i)(x Q ,r,R) = inf {g(A(x ,p, q), A(x , r, f))c (3 .ii)(a;o, Q, R) + C( 2 .7)(x ,P, q) ) , (3.10) 

p,g,r v ' 
r<p<q<R<f 

C(3.u)(xo,Q,R) = inf 2c { 3.9 ) (xo,g,g, J R)x 

q,R 

q<q<R<R 

, Q(B(x ,q),B(x ,R)) ~ N ( ' J '' j ) 

xmax r -^- ,c (2 . 7) {x ,R,R)m(B(x ,R)) 

C(2.8){X0, q, R) 

Proof. We only need to prove condition (a) of Lemma 3.2. Let q, R satisfy r < q < R < R. 
By (3.7) and (3.8) of Theorem 3.4, it suffices to prove that TT Xo ,r,q,R{y) < c (3. ii)v{xq, y)- 
For y G A(x , q, R) we have 

n X0 , r , q ,R{y) < 2c(3. 9 )5 < 2C(3 ' 9) v(x ,y), 

C(2.8) 

with C(2.s) = C(2.8)(^o, q,R)- If V e X \ B(x , R), then 

n Xo ,r,q,R(y) < 2 C(3.9)v(y,B(xo,R)) < 2c {3 . 9) c {2 . 7) m(B (x , R))p(x , y) , 

with C( 2 .7) = C(2.7)(x , R, R). The proof is complete. □ 

Remark 3.6. (BHI) is said to be scale-invariant if C(i.i) may be so chosen to depend on 
r and R only through the ratio r/R. In some applications, the property plays a crucial 
role, see, e.g., [14, 26]. If X t admits stable-like scaling, then C(i.x) given by (3.10) is 
scale-invariant indeed, as explained in Section 5 (see Theorem 5.4). 

Remark 3.7. The constant cn.n in Theorem 3.5 depends only on basic characteristics 
of X t . Accordingly, in Section 5 it is shown that BHI is stable under small perturbations. 

Remark 3.8. BHI applies in particular to hitting probabilities: if < r < R < Rq, 

x, y G B(xq, r) fl D and Ex, E 2 C X \ B(xq, R), then 

P X {X TD G Ei)V y {X TD G E 2 ) < c {1 . 1} P y {X TD G E 1 )P X (X TD G E 2 ). 
Remark 3.9. BHI implies the usual Harnack inequality if, e.g., constants are harmonic. 
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The approach to BHI via approximate factorization was applied to isotropic stable 
processes in [26], to stable-like subordinate diffusion on the Sierpihski gasket in [53], and 
to a wide class of isotropic Levy processes in [63]. In all these papers, the taming of the 
intensity of jumps near the boundary was a crucial step. This parallels the connection of 
the Carleson estimate and BHI in the classical potential theory, see Section 1. 

4. REGULARIZATION OF THE EXIT DISTRIBUTION 

In this section we prove Theorem 3.4. The proof is rather technical, so we begin with 
a few words of introduction and an intuitive description of the idea of the proof. 

In [26, Lemma 6], an analogue of Theorem 3.4 was obtained for the isotropic a-stable 
Levy processes by averaging harmonic measure of the ball against the variable radius of 
the ball. The procedure yields a kernel with no singularities and a mean value property 
for harmonic functions. In the setting of [26] the boundedness of the kernel follows from 
the explicit formula and bounds for the harmonic measure of a ball. A similar argument 
is classical for harmonic functions of the Laplacian and the Brownian motion. For more 
general processes X t this approach is problematic: while the Ikeda-Watanabe formula 
gives precise bounds for the harmonic measure far from the ball, satisfactory estimates 
near the boundary of the ball require exact decay rate of the Green function, which is 
generally unavailable. In fact, resolved cases indicate that sharp estimates of the Green 
function are equivalent to BHI ([20]), hence not easier to obtain. Below we use a different 
method to mollify the harmonic measure. 

Recall that the harmonic measure of B is the distribution of X(t#). It may be inter- 
preted as the mass lost by a particle moving along the trajectory of X t , when it is killed 
at the moment tb- In the present paper we let the particle lose the mass gradually before 
time tb, with intensity ip{X t ) for a suitable function ip > sharply increasing at dB. 
The resulting distribution of the lost mass defines a kernel with a mean value property 
for harmonic functions, and it is less singular than the distribution of X(tb). 

Throughout this section, we fix x G X and four numbers < r < p < q < R < R , 
where Ro is defined in Assumptions C and D. For the compact set B(xo, q) and the open 
set B(xo,R) we consider the bump function ip provided by Assumption B. We let 

5 = sup ma,x(Af(x), A<p(x)), (4.1) 

and 

V = {x G X : <p(x) > 0}. (4.2) 
We have V C B(x , R), see Figure 1. By Assumption B, m(dV) = 0. Note that A<p(x) < 
and A<p(x) < if x G B(x , q), and S can be arbitrarily close to g(B(x , q), B(x , R)). 

We consider a function i/i : X U {d} — > [0, oo] continuous in the extended sense and 
such that ip(x) = oo for x G (X \ V) U {d}, and if)(x) < oo when x G V. Let 

ft+E 

A t = lim / ip{X s )ds, t > 0. (4.3) 

£ V Jo 

We see that A t is a right-continuous, strong Markov, nonnegative (possibly infinite) 
additive functional, and A t = oo for t > (. We denote by M t the right-continuous 
multiplicative functional 

M t = e~ At . 

For a G [0,oo], we let r a be the first time when A t > a. In particular, r M is the time 
when At becomes infinite. Note that A t and M t are continuous except perhaps at the 



12 



KRZYSZTOF BOGDAN, TAKASHI KUMAGAI, AND MATEUSZ KWASNICKI 




single (random) moment r M when A t becomes infinite and the left limit Aij^— ) is finite. 
Since A t is finite for t < ry, we have > ry. If ip grows sufficiently fast near dV, then 
in fact = Ty, as we shall see momentarily. 

Lemma 4.1. 7/c 1 ,c 2 > are such that ip(x) > Ci(<^(a;)) -1 — c 2 for all x G V, then 
A(tv) = oo and M(ry) = P x -a.s. for every x G X. In particular, Ty = r^. 

Proof. We first assume that x G X\ V\ In this case it suffices to prove that Aq = oo. Since 
<A(p(y) < 5 for all ?/ G X, and ^(x) = 0, from Dynkin's formula for the (deterministic) 
time s it follows that E x ((p(X s )) < 5s for all s > 0. By the Schwarz inequality, 



where < e < t. Here we use the conventions 1/0 = oo and • oo = oo. Thus, 



E,( I'-^dsY 1 <( t'-dsY^J t^ds 



<P( X s) J \Je S J \J £ S 



..2 



< { I -ds] [ -ds <] 



s J J £ s \og{t/e) 



with the convention l/oo = 0. Hence, 



1 



Ex ' A t + c 2 t ' ~ Ex ' ' ^ Xs) + C2)dS 



< E , [ / ds] < d 



<p(X 8 ) ) ~ ci log(t/e) 



By taking e \ 0, we obtain 



(4.4) 



It follows that At = oo P x -a.s. We conclude that A = oo and M = P desired. 



BOUNDARY HARNACK INEQUALITY FOR MARKOV PROCESSES WITH JUMPS 13 

When x G V, the result in the statement of the lemma follows from the strong Markov 
property. If M(ry— ) denotes the left limit of M t at t = ry, then 

E X M TV = E x (M Tv .E x{Tv) (M )) = 0, 

by the first part of the proof,. The lemma is proved. □ 

From now on we only consider the case when the assumptions of Lemma 4.1 are sat- 
isfied, and ci, C2 are reserved for the constants in the condition ip(x) > ci(v?(a;)) _1 — c 2 . 
By the definition and right-continuity of paths of X t , A t and M t are monotone right- 
differentiable continuous functions of t on [0, Ty), with derivatives ip(Xt) and —tp(X t )M t , 
respectively. 

Let £«(•) be the Dirac measure at a. Lemma 4.1 yields the following result. 
Corollary 4.2. We have —dM t = if)(X t )M t dt + M{ry—)e Tv (dt) P x -a.s. In particular, 

-E x [ f(X t )dM t = E X ( [ T f{X t )i){X t )M t dt) + E x (M TV _f(X TV ); r > r v ) (4.5) 
J[o,t) \Jo J 

for any measurable random time t and nonnegative or bounded function f. □ 

We emphasize that if M t has a jump at r, in which case we must have r = Ty, then 
the jump does not contribute to the Lebesgue-Stieltjes integral _/j 0r ^ f(X t )dM t in (4.5). 
The same remark applies to (4.6) below. 

Recall that r a = inf {t > : A t > a}. Note that r a are Markov times for X t , a h-> r a is 
the left-continuous inverse of t \-> A t , and the events {t < r a } and {A t < a} are equal. 
We have A{r a ) = a unless r a = Ty, and, clearly, r a < = Ty- 

The following may be considered as an extension of Dynkin's formula. 

Lemma 4.3. For f e T>{A), Markov time r, and x G V, we have 

E x [ T Af(X t )M t dt = E x (f(X T )M T _) - f(x) - E x [ f(X t )dM t . (4.6) 

JO J[0,t) 

If g = (A — ip)f an d t < Ty , then 

E x [ g(X t )M t dt = E x (f(X T )M T _) - f(x). (4.7) 
Jo 

In fact, (4.6) holds for every strong Markov right-continuous multiplicative functional M t . 
Proof. Since e~ a da = M t and {t < r a } = {A t < a}, by Fubini, 

Af(X t )M t dt = E x Af{X t ) (J l {0iTa) (t)e- a daj dt 

/do / /•min(r,r a ) \ 

{E x j Af(X t )dtje- a da. 



Since min(r, r a ) is a Markov time for X t , we can apply Dynkin's formula. It follows that 

/•min(r,T a ) 

/ Af(X t )M t dt = E x (/(X min(TiTa) )) - f(x). 

Jo 
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By Fubini and the substitution r a = t, a = A t , e a = M t , 
E 



pT pOO 

/ Af(X t )M t dt = / (E x (f(X min(T , Ta) )) - f(x)) e~ a da 
Jo Jo 

= E X ( [ f(X mhl{T)Ta) )e~ a da) - f(x) 



o 



-E x ( [ f(X mhl{T;t) )dM t ) - f{x). 

\J\0,oo) J 



'[0,oo) 

We emphasize that the last equality holds true also if r = ry with positive probability. 
We see that (4.6) holds. By (4.5) we obtain (4.7). □ 

The functional M t is a Feynman-Kac functional, interpreted as the diminishing mass 
of a particle started at x G 3£. We shall estimate the kernel n^,ix,dy), defined as the 
expected amount of mass left by the particle at dy. Namely, for any nonnegative or 
bounded / we define 



n^fix) = -E x / fiX t )dM t , xeX. (4.8) 

J [0,00) 

Note that n^fix) = fix) for x e X \ V. By the substitution r a = t, a = A t , e~ a = M t 
and Fubini, we obtain that 

(POO \ POO 

J fiX Ta )e- a da\ = E x ifiX Ta ))e~ a da. (4.9) 

The potential kernel G^ix, dy) of the functional M t will play an important role. Namely, 
for any nonnegative or bounded / we let 

/OO POO / PT a \ 

fiX t )M t dt = E x j^ I J fiX t )dt\e- a da. (4.10) 

In the second equality above, the identities M t = e~ a da and {t < r a } = {A t < a} 
were used together with Fubini, as in the proof of Lemma 4.3. We note that G^ix,dy) 
measures the expected time spent by the process X t at dy, weighted by the decreasing 
mass of X t (compare with the similar role of Gvix,y)midy)). There is a semigroup 
of operators Tf fix) = E x ifiX t )M t ) associated with the multiplicative functional M t . 
Furthermore, Tf are transition operators of a Markov process Xf, the subprocess of X t 
corresponding to M t . With the definitions of [18], M t is a strong Markov right-continuous 
multiplicative functional and V is the set of permanent points for M t . Therefore, Xf is a 
standard Markov process with state space V, see [18, III. 3. 12, III. 3. 13 and the discussion 
after III. 3. 17]. (From (4.4) and [18, Proposition III. 5. 9] it follows that M t is an exact 
multiplicative functional. Furthermore, since M t can be discontinuous only at t = ry, 
the functional M t is quasi-left continuous in the sense of [18, III.3.14], and therefore Xf 
is a Hunt process on V. However, we do not use these properties in our development.) 

Informally, Xf is obtained from X t by terminating the paths of X t with rate ipiX t )dt, 
and n^ix,dy) is the distribution of X t stopped at the time when Xf is killed. Further- 
more, G^pix,dy) is the potential kernel of Xf . To avoid technical difficulties related to 
subprocesses and the domains of their generators, in what follows we rely mostly on the 
formalism of additive and multiplicative functionals. 

The dual multiplicative functional M t is defined just as M t , but for the dual process X t . 
We correspondingly define and G^. The subprocess Xf of X t corresponding to the 
multiplicative functional M t is the dual process of Xf; see [36, 13.6 and Remark 13.26]. 
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Hence, the potential kernel of Xf admits a uniquely determined density function 
G^(x, y) (x, y G V), which is excessive in x with respect to the transition semigroup if of 
Xf, and excessive in y with respect to the transition semigroup T t of Xf . Furthermore, 
G^(x,y) = G^(y,x) is the density of the potential kernel of Xf. Since G^(x,dy) is 
concentrated on V, we let G^(x, y) = OifxEX\Voiy&X\V. Clearly, G^(x, dy) is 
dominated by Gy{x, dy) for all x G V, and therefore 

G^(x, y) < G v (x, y), x, y G X. 

There are important relations between tt^, ip and A. If / is nonnegative or bounded 
and vanishes in X \ V, then by Corollary 4.2 we have 

^f(x) = G^f)(x), xeV. (4.11) 

Considering r = Ty, we note that M(ry) = 0, and so for bounded or nonnegative / 

/ f(X t )dM t = [ f(X t )dM t - f(X rv )M TV _. 

If / G T>(A), then formula (4.6) gives 

G lp Af(x)=ir i ,f(x)-f(x), xeV. (4.12) 

Furthermore, by (4.7), for / G T>(A) we have 

G f (A - = E x (/(X rv )M rv _) - /(x), xeV. 

In particular, if / G £>(vA) vanishes outside of V, then we have 

G^(.A - = -/(x), x £ V (4.13) 

(which also follows directly from (4.11) and (4.12)). Formula (4.13) means that the 
generator of Xf agrees with A — ip on the intersection of the respective domains. 

We now introduce the Green operators Gy and harmonic measures nfj for Xf . Let U 
be an open subset of V. For nonnegative or bounded / and x G V we let 

7r#/(x) = E X (/(X T JM TC ,_), Gg/(x) = E x H f{X t )M t dt. 

Jo 

We note that Gyf = G^f. Also, n v f = 7ty/, if / vanishes in V. Furthermore, Gfj admits 
a density function Gy(x,y), and we have Gfj(x,y) < Gjj(x,y), Gfj{x,y) < G^{x,y). If 
/ vanishes outside of V, then we can replace M(tjj—) by M(tjj) in the definition of 7rf. 
By (4.7), for any / G T>(A) we have 

4 fix) = G*(A - ^)f(x) + f(x), xeV. (4.14) 

In particular, by an approximation argument, 

4(x,E)= [ G+{x,y)v{y,E)m(dy), xeU,ECX\U. (4.15) 

Ju 

Formulas (4.14) and (4.15) can be viewed correspondingly as Dynkin's formula applied 
to the first exit time, and the Ikeda-Watanabe formula for Xf . 

Recall that xo & X, 0<r<p<q<R< R , B(xq, q) C V C B(xq, R), see Figure 1, 
if G T> is positive in V and vanishes in X \ V, and <f(x) = 1 for x G B(xq, q). 

Lemma 4.4. Let U = V\ B(x Q , q) . If (A - ip)(p(x) < for x G V, then 

n$(x,V\U)<<p(x), xeU. (4.16) 
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Proof. By (4.14), for x G U we have 

%t<p(x) - <f(x) = Gfj(A - i/))<p(x) < 0. 
It remains to note that (p — 1 on V \ U. □ 

Essentially, we use here (and later on) superharmonicity of tp with respect to A — ip. 
Lemma 4.5. If (A — ip)tp(x) < for x G V , then 

G${x,y) < C(4.i 7 )^(x), x G V \ B(x , p), y G B(x , r), (4.17) 

with C(4.i 7) = C(4.i 7 )(x ,r,p,g, J R). 

Proof. Let U — V \ B(xo,q) and x G U. Let / be a nonnegative function supported in 
B(xo,r), J f(y)m(dy) = 1 and g(z) = G^f(z) (this is done to regularize G^(x,y)). Using 
the definition of G^, the relation f(X t ) = for t < tu and the strong Markov property, 
we obtain that 

g(x) = E x QT f(X t )M t dtj = EMX^M^) = 4g{x). 

We split the last expectation into two parts, corresponding to the events X(tu) G B(xo,p) 
and X{tjj) G A(x ,p,q) respectively. By (2.10) and the inequality M(r (/ ) < 1, we have 
g(z) < C( 2 .io)( a; o, r , Pi R) for z G A(xo,p,q). From (4.16) it follows that 

Ku(.9~*-A( X0 , p , q )){x) < C(2.io)7rJ(x, 5(x , g)) < c {2 . w) <p(x). (4.18) 
For the other part, we use (4.15) and (2.7), 



^u{9^B{ X0 ,p)){x) = I I g(z)u(y,z)m(dz) ) Gfj(x } y)m(dy) 

JU \JB(xo,p) 



< C( 2 .7) / v(y,x )G{ r (x,y)m(dy) ■ / g(z)m(dz) } 

JU JB(x ,p) 

with constant C(2.7)(xo,p, q). Using again (2.7) and (4.15), and then (4.16), we obtain 

u(y,x )Gf r (x,y)m{dy) < ^f 7) / v{y,B{x Q ,p))Gi{x,y)m{dy) 
u rn{-D[Xo,P)) Ju 



m(B(x ,p)) • - ■ • m(B(x ,p)) 
By (2.9), we have 



g(z)m(dz) < / / G v (z,y)m(dz) ) f(y)m(dy) 

B(x ,p) JB(xo,r) \Jb(x ,p) 



< / E y (f v )f(y)m(dy) < c (2 . 9 ), 

JB(x ,r) 

with constant C( 2 .g)(xo, R). Hence, 



i>( , V N / (C(2.7)) 2 C(2.9)^(^) 



m(B(x ,p)) 

This and (4.18) yield that g(x) < c^_n^ip(x) , where 

C(2.9)(x ,R)(c(2.7)(x ,P,q)) 2 



C(4.i7) (xo, r, p, q, R) = c (2 .io) (x , r, p, R) + 



m(B(x ,p)) 
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Recall that g = G^f, where / is an arbitrary nonnegative function vanishing outside 
B(x ,r) with integral equal to 1. Hence, by approximation, for each x G X \ B(x ,q), 
formula (4.17) holds for almost every y G B(x ,r). By Proposition 2.2 (applied to Xf ), 
(4.17) holds for every y G B(x ,r). 

For x G A(x Q ,p, q), the result follows easily from (2.10). Indeed, we have G^(x,y) < 
Gy(x,y) < C( 2 .io) = C( 2 .io)V 9 ( a; ); with constant c^.w)(x , r, p, R). Hence, formula (4.17) 
holds also for x G A(xo,p, q), with the same constant. □ 

The above arguments can be repeated for the dual process X t . Hence, the dual versions 
of Lemmas 4.4 and 4.5 hold true, with the same c^.ii)- 

We are very close to the estimate of ir^(x,dy) for x G B(x ,r). Indeed, for y G V 
we have tt^(x, dy) = G^(x,y)ip(y)m(dy) (see (4.11)). When y G X \ V, then, at least 
heuristically, 7i^(x,dy) = AG°^(y)m(dy) , where G^(y) = G^,(x,y) vanishes outside of V 
(see (4.12)). This will give satisfactory bounds when y G X \ V. Before we proceed, we 
first show that tt^(x, dV) = 0. 

Lemma 4.6. Suppose that for some 0^,04 > 0, we have ip(x) > c 3 + (ip(x))~ l A(p(x) and 
ijj(x) < C4/ip(x) for x G V. Then for every nonnegative function f we have 



c 3 / Tty / (x)^p(x)m(dx) < c 4 / f(x)m(dx) + / f(x)Aip(x)m(dx). (4.19) 
Jv Jv Jx\v 

Proof. First, suppose that / G V(A). Denote h{x) = —(A — ip)ip(x) for x G V. Note 
that h is nonnegative. Let g(x) = Ti^f(x) for x G X; hence g(x) = f(x) for x G X \ V, 
see (4.8). By (4.12), we have g(x) = f(x) + G^Af(x) for x G V. Hence, 



g(x)h(x)m(dx) = / f(x)h(x)m(dx) + / G^Af (x)h(x)m(dx) . 
Jv Jv 

For the second term, we have 

G^Af{x)h{x)m(dx) = / Af (x)G,ph(x)m(dx) . 

Jv 

By (4.13) (dual version), G^h{x) = — G^(A — x/))(p(x) = <p(x) for x G V. Hence, 

G^pAf (x)h(x)m(dx) = / Af(x)ip(x)m(dx) = / f(x)A<p(x)m(dx). 

Jv Jx 

In the last equality, we used the fact that <p(x) = for x G X \ V. It follows that 

g(x)h(x)m(dx) = / f(x)h(x)m(dx) + / f(x)Aip(x)m(dx). 
Jv Jx 

But h(x) = —(A — ip)(p(x), so that finally, after simplification, 



g(x)h(x)m(dx) = / f (x)(p(x)^(x)m(dx) + / f (x)A(p(x)m(dx) . 
Jv Jx\v 

Using the inequalities ip{x)ip{x) < C4 for x G V and h(x) = i/;(x)ip(x) — A<p(x) > c^ip{x) 
for x G V, we obtain (4.19). The general case of nonnegative / (not necessarily in T>(A)) 
follows by approximation. □ 

Lemma 4.7. Suppose that for some 03,04 > ; we have if)(x) > C3 + (ip(x))~ l Aip(x) and 
ip(x) < Ci/ip^x) for x G V . Then M{jy— )lgv(X(rv)) = P x -a.s. and ir^(x,dV) = 
for all x G V. 
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Proof. For x G V define g{x) = 7r^(x,dV). By (4.19), f v g(x)ip(x)m(dx) = 0, so that g 
vanishes almost everywhere in V. We claim that g is excessive for the transition semigroup 
Tf of Xf. Indeed, we have g(x) = E x (M(t v -); X(t v ) G dV), so that by the Markov 
property, for any t > and x G V, 

E x (M t g(X t )) = E x (M t g(X t );t < r v ) = E x {M Ty ^X Tv G dV, t < r v ). 

The right-hand side does not exceed g(x), and by monotone convergence, it converges to 
g(x) as t \, 0. Hence g is an excessive function equal to zero almost everywhere in V. 
By [18], Proposition II. 3. 2 (or by Proposition 2.2), g{x) = for all x &V. □ 

Recall that according to the remark following Lemma 4.1, we keep assuming that 
if)(x) > Ci(v9(x)) _1 — c 2 for x G V. Consider if)(x) = c± 1 5(ip(x) + c 2) + C3 for some c 3 > 0, 
and let M t be the multiplicative functional defined in a similar manner as M t , but with 
ip replaced by if). Clearly, for all t > we have M t = if and only if M t = 0. Since 
if>(x) > C3 + 5/<p(x), an application of Lemma 4.7 to if) yields the following result. 

Corollary 4.8. Suppose that for some c > 0, we have if)(x) < c/ip(x) for x G V. Then 
M{jy— )1qv(X(tv)) = P x -a.s. for x G V. In particular, 7r^(x,dV) = for x G V. □ 

Now we make the actual choice of if>. 

Lemma 4.9. Let S be given by (4.1), and 

XGXU{9}, (4.20) 

where 1/0 = 00. For all x G B(xo,r) we have n^(x,dy) < 7r^(y)m(dy) , where 

= C(4.u) (51v\B(x , q )(y) + 2 min(5, u(y, V))l x \v(y)) (4.21) 
with C(4.i7) = C(4.i7)(xo, r, p, q, R) given in Lemma 4-5. 

Proof. Note that ip(x) > ^(^(x))- 1 -5, (A- ip)<p(x) < 0, (A - if>)<p(x) < and if)(x) < 
8/tp(x) for x G V. Hence, we may apply Lemmas 4.1, 4.4 and 4.5, Corollary 4.8, and 
their dual versions. By Corollary 4.8, 7r^(x, dV) = for all x G V. Since Atp(x) < 
and Af(x) < for x G B(xo,q), we have ip(x) = for x G B(xo,q), and therefore 
tt^(x, B(xq, q)) = for all x G V. 

Fix x G B(xq, r). If / is nonnegative and vanishes in B(x , q) and in 36 \ V, then (4.11) 
yields that 



7r^/(ar) = G^{if)f){x) = / G^(x,y)ifj(y)f(y)m(dy). 

JV\B(x,q) 

Using (4.17) for G^ and the inequality tp(y)if>(y) < 5 for y G V, we have 



7ty/(a;) < c (4 .i7) / <p(y)i>(y)f(y)™(dy) < C( 4 .i7)5 / f(y)m(dy), (4.22) 

JV\B(x,q) JV\B{x,q) 

with constant 0(4.17) (xq, r , P, 9j -R)- Suppose now that / G T>{A) vanishes in V . By (4.12), 



Tty/fa) = G^^4/ = / G#(x,y) / f(z)v(y,z)m(dz) ) m(dy) 



G^,(x,y)v(y, z)m(dy) f(z)m{dz 
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We estimate the inner integral for z6l\7. Using (4.17) for we have 



Gip(x,y)v(y,z)m(dy) < c (4 .i 7) / (p(y)v(y, z)m(dy) = c (4 . 17) A(p(z) 

V\B(x Q ,p) JV\B(x ,p) 

The integral over B(xo,p) is estimated as in the proof of Lemma 4.5, 



G^(x,y)u(y,z)m(dy) <c^.7)v(x ,z) / G v (x,y)m(dy) 

B(xo,p) J B(xq,p) 

< C i2 .7)v(x , Z)E X T V < C( 2 .7)C(2.9)V{X ,Z) 

, C( 2 .9)(C(2.7)) 2 a / J-./ u . % ( \ 

with constants c^.7)( x o,P, q), c (2.9){%o, R) and c^_i^(x ,r,p,q,R). Since Atp(z) < 5 and 
Aip(z) < v(z,V), we obtain that 



7ty/(x)< 2c (4 .i7) / f(z)A<p(z)m(dz) <2c {4 . 17) f(z)mm(5,u(z,V))m(dz). (4.23) 

By approximation, (4.23) holds for any nonnegative / vanishing in V. Formula (4.21) is 
a combination of (4.22), (4.23), n^(x, dV) = and tt^(x, B(x , q)) = for all x G V. □ 

Lemma 4.10. If a nonnegative function f is regular subharmonic in B(xq,R), then 
f(x) < ir^f(x) for x G B(xo,r). If f is regular harmonic, then equality holds. 

Proof. If / is regular subharmonic in V, then f(x) < E x (/ (X (r ))) for all a G [0,oo] 
(see [18], Proposition II. 2. 8). If / is regular harmonic in V, then equality holds. The 
result follows by (4.9). □ 

The local maximum estimate is now proved as follows. 

Proof of Theorem 3.4- Fix p G (r, q). Choose e > and tp as in the beginning of this 
section, and so that 5 = sup x£X max(Aip(x),Af(x)) < g(B(xo,q),B(xo,R)) + e. Define 
ip as in (4.20). By Lemmas 4.9 and 4.10, we have (3.7) with 7r Xo ,r,q,R(y) bounded from 
above by 7ty(y) defined in (4.21). Note that i>(y, V) < v(y, B(x , R)). Since e > and 
p G (r, q) are arbitrary, formulas (3.8) and (3.9) follow. □ 

We conclude this section with a result on diffusion processes. The above argument 
remains valid when v vanishes everywhere, i.e., X t is a diffusion process. In this case (2.9) 
is not a consequence of Assumption C, so we need to add (2.9) as an assumption. No 
other changes in the argument are needed, and in fact the proof of Lemma 4.5 simplifies 
significantly, since X t exits U through the boundary of U, and therefore X(tjj) is never 
in B(xo,p). Therefore, we have proved the following result. 

Theorem 4.11. Assume that X t is a diffusion process satisfying Assumptions A, B 
and D, and formula (2.9). Let xq G X and 0<r<q<R< Rq, where Ro is the 
localization radius of 2.9 and Assumption D. Let f be a nonnegative function on B(xq, R), 
subharmonic in B(xq,R) with respect to X t . Then 

f(x)< C(4.24) / f{y)m{dy), xeB{x ,r). (4.24) 

JA(x ,q,R) 

Here c (4 . 2 4) = c {4 . 2i )(x ,r, q, R) = c (2 .io)S, where S = g(B(x ,q),B(x ,R)) and c {2 .w) = 
c (2.io)(x ,r,q, R) are defined in Assumptions B and D. 
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Remark 4.12. For diffusion processes, local supremum estimate (4.24) for subharmonic 
functions is typically proved analytically, using Sobolev embeddings and Moser iteration, 
see, e.g., [41]. Theorem 3.4 requires more regularity of the process X t as compared to the 
analytical approach because we assume the existence of bump functions in the domain of 
the Feller generator (Assumption B), while Moser iteration is based on the energy form. 
However, our approach does not depend on Sobolev embeddings, and so it applies also to 
Sierpihski carpets and some other highly irregular state spaces X. It would be interesting 
to find an analytical proof of the local supremum estimate for jump-type processes, which 
would not require Assumption B. Related results have been recently studied when the 
Levy kernel u(x,y) is comparable to (d(x,y))~ d ~ a (see [54] and the references therein). 
Further comments on this subject are given in Example 5.6 and Appendix A. 

5. Extensions and examples 

In this section we study several applications of our boundary Harnack inequality, and 
discuss limitations of Theorem 3.5. We sketch the range of possible applications by 
indicating rather general classes of processes satisfying the assumptions of Theorem 3.5, 
without getting into technical details. Before that, however, we discuss an important 
notion of scale-invariance introduced in Remark 3.6. This property can be proved in a 
fairly general setting, which we call stable-like scaling. 

Definition 5.1. The process X t is said to have stable-like scaling property with dimension 
n > 0, index a > and localization radius Rq G (0, oo] (a-stable-like scaling in short), if 
the following conditions are met: 

(a) X is locally an Ahlfors regular n-space; that is, c~ l r n < m(B(x,r)) < cr n when 
< r < Ro and iGl; 

(b) C( 2 .7)(x , r ) R) < c(r/R) when < r < R < R Q , x G 1 in the relative constancy 
of the Levy measure condition in Assumption C; 

( c ) c (2.8)( a; 05 r ; R) > c{r / R)R~ n ~ a when < r < R < R , x G X, that is, v{x,y) > 
c(d(x,y))~ n ~ a when d(x,y) < Ro; 

(d) C(2.9)(xo, r) < cr a when < r < R , xq G X in the upper bound for mean exit 
time from a ball; 

(e) C(2.io){xq, r,p, R) < c(r /R,p/R)R a ~ n when 0<r<p<R<R and xo G X in 
the off-diagonal upper bound for the Green function of a ball; 

(f) g(B(x ,r),B(x ,R)) < c(r/R)R~ a when < r < R < R and x G X, and 
g(A(xo,p, R), A(xq, r, f)) < c(r /R,p/R, R/f)R~ a when 0<r<p<R<fin 
Assumption B. 

Proposition 5.2. If the scaling property (a) is satisfied, then conditions (b), (c) and (d) 
are consequences of: 

(g) the Levy kernel of X t satisfies 
c~\d(x,y))~ n - a exp(-qd(x,y)) < u(x,y) < c{d{x, y)Y n ~ a exp(-qd{x, y)) 
for some q > and for all x, y G X. 

Note that the same parameter q appears in the lower and the upper bound. 

Proof. Conditions (b) and (c) follow directly from (g). Furthermore, by (a) and the 
triangle inequality, there is Rq > such that if x G X and < r < _R , then for some 
y G B(xo,cir) \ B(x ,r) where c\ > 2, the balls B(x ,r) and B(y,r) are disjoint. Hence, 
for all x G B(x ,r) we have by (a) and (g), v(x,X \ B(x ,r)) > is(x, B(y,r)) > c 2 r~ a . 
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As in the proof of Proposition 2.1, it follows that Pas(TB(a: ,r) > t) — ex P( — c 2 r~ a t), and 
therefore E x (T B ( Xo , r )) < c 2 V a . □ 

We also have the following sufficient condition for scaling properties (d) and (e). 

Proposition 5.3. Assume that scaling property (a) holds. Suppose that the transition 
density T t (x, y) of a Hunt process X t exists, and that for some a > 0, r > 0, 



(5.1; 



for x, y G X with d(x,y) < r^, and any t G (0,Tq). Then Assumption D and scaling 
conditions (d) and (e) hold. The constant c^.w) an d the localization radius Rq in (2.10) 
depend only on the constants in (5.1) (including a and r ) and in the Ahlfors regularity 
condition. 

Proof. Both cases a > n and a < n are very similar (in fact, slightly simpler) to the 
remaining case a = n. Hence we give a detailed argument only when a — n. 

With no loss of generality we may assume that ro < diamX. We choose k > 2 so 
that m(B(xo,kr) \ B(xo,r)) > r n for all xq G X and r < ro/k. Let r < ro/(l + k) 1+1 ^ a , 
xo G X, D = B(xo,r), and let T t D be the transition kernel of the killed process Xf. 
Recall that Go(x,y) = J °° T 1 D (x,y)dt. Let x,y G D and let ti = (d(x, y)) a , t 2 = {2r) a . 
Since d(x,y) < 2r < r , we have 



(d(x,y))-+« J 2 



and 

"* 2 ^ 2r 



/ 2 Tf(x, y) < c (6 .i) / 2 *~ nA ^t = «c (5 .i) log 



d(ac,y) ' 

Note that for a > n or a < n, we simply have a different expression for the above integral. 
When t G [t 2 ,2t 2 ], we have t < 2t 2 < 2 l+a r a < r£, and hence T t D (x,y) < c (5 .i)t 2 ~ n/Q = 
C(5.i) (2r)~ n . Furthermore, since z) < (1 + fc)r < r for z G B(xq, kr), 



T£l(x) < T t2 l D (x) < 1 - / T t2 (x,z)m(dz) 

JX\D 



*\ 1 — / TYl(clz} 

C(5.1) J B(x ,kr)\B(x ,r) (d(x,z)) n + a 

< j _ 2 Q m(ff(x ,£;r)\ff(x ,r)) < j _ 2" 



c (5 .i) (A; + 1)™+ Q r™ ~ 0(5.1) (k + l) n+a 
For s = jt 2 +t,te [t 2 , 2t 2 ], j > 0, we have T S D = (T^Tf. It follows that 

2" V C( 5 .l) 



Ts ^^-y 1 C( 5 .i) (A; + i)n+ a J (2r) 
and therefore, by summing up a geometric series, 



T t D (x, y)dt < 2- a (c (5 . 1) ) 2 (k + l) n+a t 2 (2r)~ n = 2~ n {k + l)" +Q (c (5 .i)) 2 r a - n . 

We conclude that G D (x,y) < (c (5 . 1) /2)+nc {5 . 1) \og(2r/d(x,y))+2- n (k+l) 2n (c {5 , 1) ) 2 . This 
gives Assumption D and property (e). Property (d) follows by simple integration. □ 
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If X t has a-stable-like scaling, then, by a simple substitution, in Theorem 3.5 we have 

c {3 . 9) (x ,r,q,R) <c(r/R,q/R)R a - d , 
C(3.n)(x , q, R) < c(q/R)R a , 
C(i.i){xo,r,R) < c(r/R). 

Hence the boundary Harnack inequality is uniform in all scales R G (0,Rq), or scale- 
invariant, as claimed in Remark 3.6. We state this result as a separate theorem for 
future reference. 

Theorem 5.4. // the assumptions of Theorem 3.5 are satisfied, and the process X t 
has a-stable-like scaling, then the boundary Harnack inequality (BHI) is scale-invariant: 
c (i.i)(x ,r, R) depends only on r/R. 

In typical applications, one verifies (typically quite straightforward) conditions (a) 
and (g), formula (5.1) (which has been proved for a fairly general class of processes), and 
condition (f). When dealing with processes given the Levy kernel v(x,y), condition (f) 
turns out to be the most restrictive one. 

Example 5.5 (Levy processes). Theorem 3.5 applies to a large class of Levy processes. 
In this case, the notion of processes in duality and properties of the Feller generator 
simplify significantly, see [70]. 

Let X t be a Levy process in X = R fc (with the Euclidean distance d and Lebesgue 
measure m). Then X t is always Feller, and it is strong Feller if and only if the distribution 
of X t is absolutely continuous (with respect to the Lebesgue measure). If this is the 
case, Assumption A is satisfied: the dual of X t exists, and it is the reflected process, 
X t — Xq = —(Xt — Xq). Assumption B is always satisfied with T> = C^°(R fc ). The Levy 
kernel of X t is translation- invariant, v[x, E) = v(E — x), where v[dz) is the Levy measure 
of X t . Therefore, Assumption C can be restated as follows: the Levy measure of X t is 
absolutely continuous, and its density function v(z) satisfies 

c (2.7) u ( z o) < u ( z ) < c (2.7) K^q), \zo\ > R,\z- z \ < r, (5.2) 

whenever < r < R, with constant C(2.7)(0, r, R). If, e.g., v[z) is isotropic and radially 
decreasing, then (5.2) is equivalent to v(z2) > cu(zi) being valid whenever \zi\ > 1 
and 1 22 1 — \zi | + 1. Finally, Assumption D in many cases follows from estimates of the 
potential kernel U(x,y) = U(y — x), or, in the recurrent case, the a-potential kernel 
U a {x,y) = U a {y-x). 

We conclude that boundary Harnack inequality holds for a Levy process X t , provided 
that its Levy measure satisfies (5.2), one-dimensional distributions of X t are absolutely 
continuous, and the Green functions of balls satisfy Assumption D. This class includes: 

• subordinate Brownian motions which are not compound Poisson processes and 
have non-zero Levy measure density function satisfying v(z2) > cv(z{) if \z\\ > 1 
and \z 2 \ = \zi\ + 1 (for properties of these processes, see, e.g., [23, 62]); 

• (possibly asymmetric) Levy processes with non-degenerate Brownian part and 
Levy measure satisfying (5.2); 

• (possibly asymmetric) strictly stable Levy processes, whose Levy measure is of 
the form \z\~ d ~ a f(z/\z\)dz for a function / bounded below and above by positive 
constants. 

Scale-invariance is a different question, which depends on more accurate estimates. We 
list some examples. 
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• For the class of strictly stable Levy processes described above, scale-invariance 
follows from the estimates of the transition density given in [79, Theorem 1.1] and 
Proposition 5.3; see also [28] and the references therein for related estimates in 
the symmetric (but anisotropic) case. 

• Some Levy processes for which Theorem 3.5 gives scale-invariant BHI are included 
in Example 5.6 (stable-like Levy processes) and Example 5.8 (mixtures of isotropic 
stable processes, relativistic stable processes, etc.). 

• A non-scale-invariant case (mixture of an isotropic stable process and the Brow- 
nian motion) is discussed in Example 5.13. 

• Using the results of the first part of [63], one can obtain scale- invariant BHI for 
the class of Levy processes considered therein (isotropic Levy processes with Levy 
measure comparable to the Levy measure of a sufficiently regular subordinate 
Brownian motion), thus significantly simplifying the other part of that article. 

• Also, the estimates given in [58], combined with Theorem 3.5, imply scale-invariant 
BHI at least for sufficiently regular subordinate Brownian motions (other than 
compound Poisson processes) with Levy-Khintchine exponent slowly varying at oo. 

Example 5.6 (Stable-like processes). Let X be a closed set in R fc , and let m be a measure 
on X such that X, with the Euclidean distance, is an Ahlfors regular n-space for some 
n > 0. For example, X can be entire R fe or the closure of an open set in R fc (with the 
Lebesgue measure m; then n = k). On the other hand, X can be a fractal set, such as 
Sierpihski gaskets (n = \og(k + l)/log2) or Sierpihski carpets (n = log(3 fc — l)/log3) 
in R 2 , equipped with an appropriate Hausdorff measure. By this assumption, scaling 
property (a) is satisfied. 

Let a G (0,2), and suppose that u(x,y) = u(y,x) and 

ci\x-y\~ n ~ a < v(x,y) < c 2 \x-y\~ n ~ a , x,yeX. (5.3) 

This immediately gives Assumption C with scaling property (g). 

By [33, Theorem 1], there is a Feller, strong Feller, symmetric pure-jump Hunt process 
X t with Levy kernel u, and the continuous transition probability T t (x,y) of X t satis- 
fies (5.1) for some Tq. Assumption D and scaling property (e) follow by Proposition 5.3. 
Since X t is symmetric (self-dual) and has continuous transition densities, Assumption A 
is also satisfied. 

Finally, we assume that Assumption B holds with scaling property (f) (see below). 
Under the above assumptions, scale-invariant boundary Harnack inequality holds with 
some localization radius. When X is unbounded, a ^ n and scaling property (a) holds 
for all r > 0, then (5.1) holds for all t > and all x, y G X, see [33], and therefore we can 
take R = oo. 

We list some cases when Assumption B with scaling property (f ) is known to hold true. 

• When X = R fc and v{x, y) is a function of x — y, then X t is a symmetric Levy 
process and we can simply take T> = C^°(R k ). 

• More generally, let X = R fc , and assume that v(x,y) = n(x,y)\y — x\~ k ~ a for a 
C£°(R fe x R fc ) function k. We claim that Assumption B with scaling property (f) 
holds for V = L7 6 °°(R fc ). Indeed, for / G C c °°(R fc ) let 

A f(x) = [ (f{x +z)- fix) - * ■ V/(*)) dz 

Jn*\ 1 + 1*1 J \A (54) 
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Then A is a symmetric pseudo-differential operator with appropriately smooth 
symbol, and by [48, Theorem 5.7], the closure of A is the Feller generator of a 
symmetric Hunt process X t (we omit the details). Since the pure-jump Feller 
processes X t and X t have equal Levy kernels, they are in fact equal processes, 
and hence the closure of A is the Feller generator of X t . Assumption B with 
V = C^°(R k ) follows, and scaling property (f) is a simple consequence of (5.4). 
See also [47, 77]. 

• When a G (0,1), X is the closure of an open Lipschitz set, and v(x,y) = 
c\x — y\~ k ~ a , then the desired condition is satisfied by V = C£°(R fc ) (see [44, 
Theorem 6.1(i)]). 

• When a G [1,2), X is the closure of an open set with C 1,/3 -smooth boundary for 
some > a— 1, and u(x,y) = c\x — y\~ k ~ a , then one can take T> to be the class of 
C£°(R fc ) functions with normal derivative vanishing everywhere on the boundary 
of X (see [44, Theorem 6.1(h)]). 

• For the case when X t is a subordinate diffusion on X, see Example 5.7. In this 
case, when X is a fractal set, one can even deal with a greater than 2. 

Note that an analytical proof of Theorem 3.4 discussed in Remark 4.12 may lead to a 
generalization of this example, which would not require Assumption B. 



Example 5.7 (Stable-like subordinate diffusions in metric measure spaces). Suppose 
that (X, d, m) is an Ahlfors regular n-space for some n > 0. Assume that the metric d is 
uniformly equivalent to the shortest-path metric in X. Suppose that there is a diffusion 
process Z t with a symmetric, continuous transition density T t z (x,y) satisfying the sub- 
Gaussian bounds 



fn/d n 



exp -c 2 



d(x,y) c 



< 



_C3_ 
fn/du 



exp — c 4 



<T t z (x,y) 

'dU.y)' 1 ^ ; 



(5.5) 



for all x,y G X and t G (0, to) (^o = 00 when X is unbounded). Here d w > 2 is the 
walk dimension of the space X. The existence of such a diffusion process Z t is well- 
known when X is a Riemannian manifold (d w = 2; see [42]), the fc-dimensional Sierpihski 
gasket (d w = \og(k + 3) /log 2 > 2; see [11]), more general nested fractals [40, 66], or the 
Sierpihski carpets [7, 8]; see [57] for more information. 

Let a G (0, d w ) and let X t be the stable-like process obtained by subordination of Z t 
with the a/d w -stab\e subordinator rjt, X t = Z(r] t ). These processes were first studied 
in [27, 67, 74]. By the subordination formula, the transition density estimate (5.1) holds 
for some r (if X is unbounded, then it was proved in [27] that we can take r = oo). 

Since X t is symmetric and has continuous transition densities, Assumption A is clearly 
satisfied. The Levy kernel of X t satisfies c~ 1 d(x, y)~ n ~ a < v(x, y) < cd(x, y)~ n ~ a , see [27], 
and Assumption C with scaling property (g) follows. Assumption D and scaling prop- 
erty (e) follow from the transition density estimate (5.1) by Proposition 5.3; see also [27, 
Lemmas 5.3 and 5.6]. Finally, Assumption B with scaling property (f) follows by the con- 
struction of [69, Section 2]. Roughly speaking, the method of [69] yields smooth bump 
functions in the domain of the generator of the diffusion Z t with appropriate scaling. 
By the subordination formula, these bump functions are in the domain of A, and the 
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constants scale appropriately. Since there are some nontrivial issues related to the con- 
struction, we repeat the construction with all details in Appendix A. By Corollary A. 4 
there, Assumption B is satisfied with scaling property (f). 

We conclude that scale-invariant boundary Harnack inequality for X t holds in the full 
range of a G (0, d w ). Noteworthy, we obtain a regularity result also for a > 2, when 
Lipshitz functions no longer belong to the domain of the Dirichlet form of X t . 

This example can be extended in various directions. Instead of taking r]t the a/d w - 
stable subordinator, one can consider a subordinator r) t whose Laplace exponent ip is a 
complete Bernstein function regularly varying of order a/d w (a G (0,d w )) at infinity. 
Such subordinators have no drift, and the Levy measure with completely monotone den- 
sity function, regularly varying of order — 1 — a/d w at 0. Their potential kernel is regularly 
varying of order — 1 + a/d w at 0. We refer the reader to [23, 62, 71] for more information 
about subordination, complete Bernstein functions and regular variation. By the subor- 
dination formula, following the method applied for the Euclidean case X = R fc in [62, 63], 
one can obtain two-sided estimates for the Levy kernel v{x,y) and the potential kernel 
U(x,y) in terms of ip, a t least when X is unbounded and a < d. These estimates are 
sufficient to prove the scale-invariant boundary Harnack inequality. 

Similar methods should be applicable also when X t is recurrent (that is, X is bounded, 
or a > d). In this case, estimates of U(x,y) need to be replaced by estimates of the 
A-potential kernel U\(x,y). Another interesting directions are the case of slowly varying 
if), which corresponds to a = 0, and, on the other hand, the case of pure-jump processes 
with ip regularly varying of order 1 (that is, a = d w ). Finally, one can perturb processes 
considered above, in a similar way as in the next example. 

Example 5.8 (Stability under small perturbations). Let X = R z , d be the Euclidean 
distance, m be the Lebesgue measure, and a G (0,2). Suppose that v{x,y) is a Levy 
kernel of a Hunt process X t considered in Example 5.6, and A is the corresponding Feller 
generator. For example, z/(x,|/) can be any function of y — x satisfying (5.3). In this 
example we consider a perturbation u(x,y) of the kernel v(x,y). 

Although a more general construction is feasible, we are satisfied with the following 
setting. Let z/(x,y) = v[x,y) +n(x,y), where n(x,y) is chosen so that z/(x,y) satisfies 
the scaling property (g), n(x,y) and h(x,y) = n(y,x) are kernels of bounded operators 
on C (R fc ), and 

/ n(x,y)dy = / h(x 1 y)dy ) x G R fe ; 

the last assumption guarantees that m is an excessive (in fact, invariant) measure for the 
process X t defined below. 

The formula J\ff(x) = J Rfc (/(|/) — f(x))n(x, y)dy defines a bounded linear operator on 
Co(R fc ), and A = A+Af (defined on the domain of A) has the positive maximum property. 
By a standard perturbation argument, A is the Feller generator of a Hunt process X t , and 
u(x, y) is the Levy kernel of X t . The process X t and its Feller generator A are constructed 
in a similar manner, using the Feller generator of the dual of X t and the kernel n(x, y). It 
is easy to see that J Rfc Af(x)g(x)dx = J Rfc f(x)Ag(x)dx for f,g G C^°(R fc ), from which 

it follows that X t is indeed the dual of X t . 

The transition density of X t satisfies (5.1) (see Example 5.6). The process X t can 
be constructed probabilistically using X t and Meyer's method of adding and removing 
jumps. Hence, by [9, Lemma 3.6] and [10, Lemma 3.1(c)], the transition density of X t 
exists and also satisfies (5.1) for smaller r (see also [30, Proposition 2.1]). 
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It follows that Assumption A is satisfied. Assumption B holds with V = C^°(R k ), 
and scaling property (f) (with finite Rq) follows from the a-stable-like scaling of A and 
boundedness of Af. Since we assumed that (g) holds true, Assumption C is satisfied with 
scaling properties (b), (c). Assumption D and scaling properties (d), (e) follow from 
transition density estimate (5.1) by Proposition 5.3. Hence, scale-invariant boundary 
Harnack inequality holds true for X t . 

The above setting includes mixtures of isotropic stable processes (Levy processes gen- 
erated by A = -(-A) o/2 - c(-A) /3/2 with < /3 < a < 2 and c > 0) and relativistic 
stable processes (Levy processes generated by A = m — (—A + m 2 ^ a ) a ^ 2 with m > 0). 
Also, the dependence of constants on the parameters c, (3, m can be easily tracked. Since 
the perturbation n(x, y) can be asymmetric, many non-symmetric processes are included. 
Finally, this example can be adapted to the setting of Ahlfors-regular n-sets in R fc , as in 
Example 5.6. 

Example 5.9 (Processes killed by a Schodinger potential). Suppose that the assumptions 
for the boundary Harnack inequality in Theorem 3.5 are satisfied. Let X' be an open set 
in X. Let M t be a strong right-continuous multiplicative functional quasi-left continuous 
on [0, oo), for which all points of X' are permanent, and such that M t = for t > T%r. 
Finally, let X t M be the subprocess corresponding to M t (in a similar way as in Section 4; 
see [18] for definitions). Then Xf 1 is a Hunt process on X', uniquely determined by the 
relation Pf (X t M G E) = E x (M t ; X t G E) for any E C X' and x G X' . 

Assume that M t is a continuous function of t G [0, t%i). We claim that in this case the 
Levy kernel v M {x,y) of Xf 1 is again given by u(x,y), restricted to X' x X'. Indeed, by 
formula (4.6) of Lemma 4.3, for x G X' and / G T>(A) vanishing in a neighborhood of x, 
we have 

Ef (f(X t M )) - f{x) = E x (f(X t )M t ) - f(x) 

= E x Qf Af(Xt)Mtdtj + E x (J f(X t )dM t } . 

When divided by t, this converges (for a fixed x) to Af(x) as t — > + . Hence, v M f{x) = 
vf{x). By an approximation argument, this holds for any / G C C (X') vanishing in a 
neighborhood of x, proving our claim. (Note that, however, in general, functions in T>(A) 
need not belong to the domain of the generator of X t M , even if X' = X. ) 

We remark that many such functionals M t are related to Schrodinger potentials V: for a 
nonnegative function V, we have M t = exp(— J Q * V(X s )ds) for t < see [18]. A similar 
construction was used in Section 4 for a particular choice of V. In some applications, the 
potential V can take negative values, the case not covered by this example. 

Let D C X be an open set. By the definition of a subharmonic function, a nonnegative 
function / regular subharmonic on D D X' with respect to the process X 4 M , extended by 
f(x) = for x G X \ X', is also regular subharmonic in D with respect to X t . Hence, the 
hypothesis of Theorem 3.4 holds for Xf 1 with the same constant C(3.g). Of course, one 
needs to replace the sets in the statement of Theorem 3.4 by their intersections with X'. 

We claim that also Lemma 3.1 holds for Xf 1 with the same constant. Indeed, with the 
definitions of the proof of Lemma 3.1 and D' — D n X', for x G B(xq, r) n X' we have 

P™(XM Di eX'\B(x Q ,R)) = E x (M TD -X TD eX\B{x ,R)) 

<E x {f{X TD )M TD )-f{x). 
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By formula (4.6) of Lemma 4.3, 

E x (f(X TD )M Tn ) - f(x) = E X ( r Af(X t )M t dt] +E X ( [ f(X t )dM t 

The second summand on the right hand side is nonpositive. It follows that, 

Pf (X% G X' \ B(x , R)) < E x ( [ ° M t dt] sup Af(y) 

\Jo J yex 

= Ef \t di ) sup A f(y), 

yeSL 

as desired. 

In Lemma 3.2, only the estimates of the Levy measure and mean exit time are used. 
Therefore, also Lemma 3.2 holds for the process X^ with unaltered constants. In a 
similar way, the proof of Theorem 3.5 works for the process Xf' 1 without modifications. 
We conclude that the boundary Harnack inequality holds for X t with the same constants. 
For convenience, we state this result as a separate theorem. 

Theorem 5.10. Suppose that Assumptions A, B, C and D hold true. Let X' be an 
open subset of X, and let X t M be a subprocess of X t , with state space X' , corresponding 
to a strong right- continuous multiplicative functional for X t , continuous before X t hits 
X\ X' , vanishing after that time, and quasi-left continuous on [0, oo). Then the boundary 
Harnack inequality holds true for the process Xf 1 with the same constant cn.i) given 
by (3.10). More precisely, if xq G X, < r < R < R Q , D C B(xo,R) is open, f,g are 
nonnegative regular harmonic functions in D C\X' ( with respect to the process Xf 1 ), and 
f, g vanish in (B(xo, R)\D) fl X' , then we have 

f(x)g(y) < C(i.i) g(x)f(y) , x, y G B(x , r)f] Df] X', 

where C(i.i) = C(i.i)(xo, r, R) does not depend on M t . 

We remark that the continuity assumption for M t is essential. If, for example, M t is 
equal to 1 until the first jump larger than 1, and then 0, the boundary Harnack inequality 
typically does not hold, by an argument similar to one in Example 5.14 below. 

Example 5.11 (Actively reflected and censored stable processes). Let X' C R fc be open 
and let X be the closure of X' in R fe . Suppose that X satisfies property (a). Let u(x, y) = 
c\x — y\~ n ~ a . As in Example 5.6, under suitable assumptions on X, there is a stable-like 
process X t with the Levy kernel u(x,y), and scale-invariant BHI holds for X t . In [21], 
the process X t is called actively reflected a-stable process in X, and the process X' t) 
obtained from X t by killing it upon hitting X \ X', is named censored a-stable process in 
X' (see [21, Remark 2.1]). Clearly, the boundary Harnack inequality for X' t is the special 
case of the boundary Harnack inequality for X t , corresponding to open sets D contained 
in X'. (Note that this is in fact a special case of Theorem 5.10, with M t = 1 for t < tx>.) 
Hence, we have scale-invariant BHI for the actively reflected a-stable process X t and the 
censored a-stable process X' t , whenever X' is a Lipschitz set in the case a G (0, 1), and 
X' is an open set with C^-smooth boundary for some (3 > a — 1 in the case a G [1, 2). 
The above extends the results of [21, 43]. 

Example 5.12 (Gradient-type perturbations of stable processes). Let a G (1,2). If 
b : R fc — > R fc is bounded and differentiable, partial derivatives of b are bounded, and 
div6 = 0, then the process X t generated by —(—A)"/ 2 + b ■ V, and the process X t 
generated by —(—A)"/ 2 — b ■ V are mutually dual. Such processes are considered in the 
recent paper [51]. The Levy kernels of X t and X t are the same as that of the isotropic 
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a-stable Levy process generated by (— A) Q//2 , see [25]. Furthermore, T> = C£°(R fe ) is 
contained in the domains of A and A. Therefore, a scale-invariant (with finite Rq) 
boundary Harnack inequality holds for the process X t . 

We conclude this article with some negative or partially negative examples. 

Example 5.13 (Levy processes with Brownian component). Let X = R fc , d be the 
Euclidean distance, m be the Lebesgue measure, and a G (0,2). Let X t be the sum of 
two independent processes, the Brownian motion and the isotropic a-stable Levy process. 
That is, X t is the Levy process with generator A = c x A — c 2 (— A) Q//2 . 

Clearly, X t is symmetric and has transition densities, so Assumption A is satisfied. Fur- 
thermore, T>(A) contains C^°(R fc ), and hence Assumption B is satisfied with 2-stable-like 
scaling: the property (f) holds with a replaced by 2. On the other hand, Assumption C 
clearly holds with a-stable-like scaling (g). Furthermore, detailed estimates for the tran- 
sition density of X t can be established ([34]), from which Assumption D follows as in 
Proposition 5.3, with 2-stable scaling. 

It follows that boundary Harnack inequality holds despite the diffusion component. 
However, the constant cnufeo, r, R) is not bounded when, for example, R = 2r and 
r — > + . This is a typical behavior for processes comprising both jump and diffusion 
part, and for general open sets one cannot expect a scale-invariant result: the boundary 
Harnack inequality in the form given in (BHI) does not hold for the Brownian motion 
without some regularity assumptions on the boundary of D, cf. [14]. On the other hand, 
the scale-invariant boundary Harnack inequality for X t in more smooth domains was 
established in [32]. 

Example 5.14 (Truncated stable processes). This example shows why Assumption C 
is essential for the boundary Harnack inequality in the form given in (BHI). Consider 
the truncated isotropic a-stable Levy process X t in X = R fe , a G (0,2), n > 1. This 
is a pure-jump Levy process with Levy kernel v(x,y) = c\x — y\~ n ~ a lB( x ,i){y) ■ Clearly, 
Assumptions A, B and D, as well as formula (2.9), hold true with a-stable-like scaling 
and R = 1, but Assumption C is violated. 

We examine two specific harmonic functions. Let v be a vector in R d with \v\ = 2/3, 
let r G (0,1/6) be a small number, and define B\ = B(xx,r) and B2 = B(x2,r), where 
Xx, %2 G R fc are arbitrary points satisfying x\ — £2 = v. Let D = Bx U B 2 , E\ = Bx + v, 
E 2 = B 2 — v, and let fj(x) = P x (X(td) G Ej). Suppose that x G B\. By (2.12), we have 

S-^cIE^ExTb, < fx{x) < Z n+a c\Ex\E, x T D . 

When x G B 2 , then, again by (2.12), 

fx(x) < P x (X{t B2 ) G Bx) ■ sup fx{y) 

yeBj 

< cT +a \B l \E x r B2 ■ 3 n+a c\Ex\ sup E y r D . 

yeB 2 

Similar estimates hold true for f 2 . It follows that 
fx{x 2 )f 2 {xx) ^ 2 

<< a (3" +a ) 6 |i?(0,l)| 2 r 2 "f sup EyT Bm 

\y£B(0,X) 

This ratio can be arbitrarily small when r — > 0, and therefore (BHI) cannot hold for 
truncated stable process uniformly with respect to the domain. We remark that by an 
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appropriate modification of the above example, one can even construct a single domain 
(an infinite union of balls) for which (BHI) is false. Also, modifications of the above 
example for other truncated processes, or for processes with super-exponential decay of 
the density of the Levy measure can be given. 

On the other hand, if the regular harmonic functions / and g (of the truncated te- 
stable process X t ) vanish outside a unit ball, then clearly / and g are harmonic in D 
also with respect to the standard (that is, non-truncated) isotropic a-stable process in 
R fe . Therefore, the boundary Harnack inequality actually holds true for such functions. A 
different version of boundary Harnack inequality was proved for X t under some regularity 
assumptions on the domain of harmonicity in [59, 60]. 

Appendix A. Smooth bump functions on metric measure spaces with 

sub- Gaussian heat kernels 

In this part we repeat the construction of smooth bump functions of [69]. We adopt 
the setting of Example 5.7: Z t is a diffusion process on an Ahlfors- regular n-space X, the 
transition semigroup Tf of Z t satisfies sub-Gaussian bounds (5.5), and X t is defined to 
be the process Z t subordinated by an independent a / d w -stab\e subordinator, a G (0, d w ). 
The generator of Z t serves as the (Neumann) Laplacian A on X, and Tf is the heat 
semigroup. 

Let h = Tfg for some t > and g G L 2 (X). One of the main results of [69], The- 
orem 2.2, states that given any compact K and e > 0, there is a function / such that 
/ G T>(A ) for all I > 0, f(x) = h(x) on K and f(x) = when dist(x,i^) > e. There are 
at least three issues when one tries to apply this result in our setting. 

First, Theorem 2.2 in [69] is given under the assumption that the spectral gap of A is 
positive. However, this assumption is used only in the proof of Lemma 2.6, which contains 
a flaw: positivity of the spectral gap A does not imply the inequality \\Ptf — /||l 2 (x) < 
A^||/||l 2 (x) ( see hne 3 on page 1769 and line 12 on page 1773 in [69]). This issue has been 
resolved by the authors of [69] in an unpublished note, containing a corrected version of 
the proof of Lemma 2.6. The new argument does not involve the condition on the spectral 
gap, which therefore turns out to be superfluous. For future reference, we provide the 
corrected version of the proof of Lemma 2.6 below. 

Second, to get Assumption B, we need to apply the above theorem with h(x) = 1 for 
x G K, where h = Tfg. This condition is satisfied when g(x) = 1 for all x G X. However, 
such a function g is in L 2 (3L) only when m is a finite measure, and the general case is not 
covered by [69]. For that reason, we choose to repeat the construction of [69] in L°°(X) 
(instead of L 2 (X)) setting. 

Finally, for a scale-invariant boundary Harnack inequality, we need an upper bound 
for || A/|| i00 (£) with explicit dependence on scale, that is, explicit in e and the size (e.g. 
the diameter) of K. Such properties of the estimates are irrelevant in [69], but it turns 
out that they can be obtained by carefully following the proof of Theorem 2.2 in [69]. 

For the above reasons, we decide to give a complete proof of an L°°(X) version of 
Theorem 2.2 in [69]. However, it should be emphasized that method was completely 
developed in [69]. Although we only need the result for g(x) = h(x) = 1 for all x G X, 
for future reference we consider the general case. 

Theorem A.l (a variant of [69, Theorem 2.2]). Suppose that K C X is a compact set, 
e, s > and h = T^g for some g G L°°(X). Then there is a function f G L°°(X) such 
that f(x) = h(x) for x G K , f(x) = when dist(x, K) > e, and f G V(A l ) for any I > 0. 
Furthermore, the L°°(X) norm of f is bounded by the L°°(X) norm of g, f is nonnegative 
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if g is nonnegative, and for all I > we have 



l£°°(£) 

where c (A .i) = c (A .i)(Z, e dw /s, Z t ). 



c (A .i)(diamK + e) n / 2 



ld w +n/2 



(a.i; 



Proof. We divide the argument into five steps. All constants in this proof may depend 
not only on the parameters given in parentheses, but also on the space X and the process 
Z t . Since we never refer to the semigroup of the subordinate process X t , in this proof 
for simplicity we write T t = Tf . Furthermore, also in this proof only, we extend A to 
the L°°(X) generator of T t (recall that originally A was defined as the C (X) generator), 
and denote by A L 2, X \ the L 2 {X) generator of T t , that is, the generator of the semigroup 
of operators T t acting on L 2 {X). Clearly, Af = A L 2(%\f m-a.e. whenever / 6 £*(A) n 
V{A L 2 [X) ). 

Step 1. We begin with some general estimates. By the spectral theorem and the 
inequality X l e~ xt < (left) 1 , for any / e L 2 (X) and I > 0, we have T t f e V((A L 2 {X) ) 1 ), and 



L 2 (X). 



%f\ 



< (left) 1 



Furthermore, by sub-Gaussian estimates (5.5), \\T t (x 
Hence, 

< Cl r n/( - 2dw) 



L 2 {x) - v-/ "i ii j ul 2 (x) ■ 

L 2 {x) = (T 2t (x,x)) l i 2 < Cl r n i^. 



\Ttf\\ 



£°°(£) 



£ 2 (£) • 



We find that 



T s T t f - T t f 



Atf(x)Ttf 



< 



(t/2) n /( M ») 



T s T t / 2 f - T t / 2 f 



^L 2 (X)Tt/2f 



->■ 



as s — > + . It follows that T t f e T>(A), with AT t f = A L 2^T t f. By a similar argument, 
T t f e V(A l ) for any I > 0, and 

cs(l) 



\A l T t f\ 



L°°(X) 



\T t / 2 A T t / 2 f\ 



< 



(■■> 



(X) - t l+n/(2d w ) WJWL^X)- 



\L°°(X) - t n/(2d w ) 

Sub-Gaussian estimate (5.5) and Ahlfors regularity of X also give the following estimate: 
for any set E C X, any e > and any / e L°°(j£) or / G L 1 (X) vanishing in the 
e-neighborhood of i?, we have 



\TJ\\ Loo(E) < D(e,t) \\f\\ Lo 



>(X) 



and 



T t f\\ L1(E) <D(e,t) 



L l (X) J 



where 



Z?(e, £) = sup 



T t (x,y)m(dy) < c 4 exp(-c 5 (^/t) 1/(dtu - 1) ). 

ze* ^3£\B(x,e) 

In particular, given any s > and e > it is possible to choose a strictly increasing 
sequence Sj > convergent to s, with s = 0, such that if tj = Sj — Sj_i (j > 1), then 



lim D(2- j e,s-Sj) = 



and 



^ D(2-%U) ^ c 6 (l,e d -/s) ^ 

2^ r, < n < OO 



i=l 



for any e > 0, I > 0. For example, one can take Sj — (1 — 4 d ™ J )s. Note, however, that 
the above series would diverge if tj decreased either too slowly or too rapidly. 
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Step 2. Let g G L°°(X), s,s > 0, and h(x) = T s g(x), as in the statement of the 
theorem. Following [69] , for j > we define 

Kj = {x G X : dist(z, K) < 2~ j e}, Lj = {x G X : dist(x, K) > (1 - 2^>}, 

and Aj = X \ (Kj U Lj). Furthermore, let Sj and tj be chosen as in Step 1. For j > 1 we 
define 

Uq(x) = 0, Uj(x) = l K .(x)T Sj g(x) + l Aj (x)T t .Uj-i{x). 

Below we prove that T s _ s .Uj converges to a function / with the desired properties. 

Step 3. By induction, ||%|| £,«>(£) < ||<7|| £«>(£) for any j > 0. For j > 1 we have 
Uj-i(x) = T S]1 g(x) for x G i£j-i, and dist(i^j,X \ Kj-i) > 2 _ %. Hence, 

|| Wj - T^uj^W^^ = ||T t .(T s ._ l5 - %-i)|| ioo(K .) 

< D{2^e,t j ) WT^g-u^W^ < 2D{2^e,t j ) \\g\\ LO o {x) , 

where D(2~^e,tj) is as in Step 1 (cf. [69, Lemma 2.3]). Also, Uj vanishes on Lj, Uj-i 
vanishes on Lj_i, and dist(Lj, X \ Lj-i) > 2^^e, so that 

hi - T *^i-i|L~ ( ^) = || T % w i-iL~ ( ^) ^ D ( 2 ~ 3£ ^) \\ u j-i\\ L ~(x) > 
and (using X \ L,_i C K ) 

< D{2~ 3 e,t j ) \\uj-i\\ Ll{X) < D(2- J e,t j )m(K ) ||uj_i|| LO o (x) • 
Hence, using also ||wj-i||i°o(x) < ||<?||l°°(£), we obtain (cf. [69, Lemma 2.5]) 



\Uj - TtMj^W^^ < yJWuj - TtMj^W^^ \\uj - TtjUj-iW^^ 



< D{2- j £,t j Wm{K ) 



(A.3) 



L°°(X) • 



Step 4- We follow the corrected version of the proof of [69, Lemma 2.6]. Let / > 0. For 
j > 1 we have 

j 



Observe that the results of Step 1 and the equality Ui(x) = T ti Ui-i(x) for x G Ai give 



\\A l T 8 - Si (ui - r ti ^-i)|| Xoo(x) < (s _ s C y+i/(2d m ) IN - T uv*-i\\v(£) 

1=1 ^ ' ' 

c 3 (l) 



^ Yl J+n/(2d w ) (lN " TtiUi-lWviKJ + IN ~ T UUi-l 
i=l 

Hence, by (A. 2) and (A.3), 



J2 \\A l T s - Si (Ui -^«i-l)Lco (3E) < 3C 3 (0\/™(^0) IMIl~(£) j^ ^/^ 
i=l i=l 

. o m / 7^ \ I, H c 6 {l + n/{2d w ),e d ™/s) 

<3c 3 (l)Vm(K ) \\g\\ Loa(X) -^j~ 2 
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It follows that the sequence A l T s - Sj Uj converges in L°°(X) as j — > oo for every I > 0. 
Therefore, if f(x) = lim^oo T s - Sj Uj(x), then for all Z > we have / G V(A l ) and 

oo 



i=l 

c 7 (l,e dw Is) / — — — ,. , 
< £ i dw+ n/2 vW^o) \\g\\ LX{x) 



as desired. 

Step 5. By the definition of Uj, for j > 1 we have 

T s - s .Uj = T s - Sj (l Kj T Sj g + \ iij i) 

= + T s _ Sj .(l Aj .T ti 'u i _i - l x \ Kj T Sj g). 

It follows that 

\\Ts-sjUj - T s g\\ Loo{K) = \\T s - s .(l Aj T t .u j - 1 - l x \ Kj T Sj g)\\ Lac(K) 

< D(2~ 3 e,s- sj) \\lA.Tt.Uj-! - l x \ Kj T Sj g\ 

< 2D(2~ 3 e,s- sj) \\g\\ L oo {x) ■ 

The right hand side converges to as j — > oo. Hence, f(x) = T s g(x) = h(x) for x G K. 
Furthermore, || < ||<?|U°°(;3e), and therefore also ||/||l°°(x) < ||s , |U° c (£)- Finally, if 
g > 0, then Uj > for all j > 1, and so / > 0. □ 

By choosing g(x) = h(x) = 1 and s = e dw , we obtain the following result. 

Corollary A. 2. Suppose that K C X is a compact set and e > 0. Tnen i/iere zs a 
function f G L°°(X) snc/i £/iat /(a;) = 1 /or x E K, f(x) = u>/ien dist(x,K) > e, and 
f G T>(A l ) for any I > 0. Furthermore, < f(x) < 1 for all x G X, and /or all I > we 
have 

iiA'/L. g) < C( "' (d ;,r^ +£) " /2 . (A.4) 

w/iere c (A .4) = C(A.4)(£, ^t)- □ 

In general, the boundary of the set {a; G X : /(x) > 0} might be highly irregular. 
However, when we relax the smoothness hypothesis on /, we can require / to be positive 
on an arbitrary given open set. 

Proposition A. 3. Suppose that K C X is a compact set, e > and L > 0. Then there is 
a function f G L°°(X) such that f(x) = 1 for x G K, f(x) = when dist(x, K) > e, and 
f G T>(A l ) for I = 1,2, ... ,L. Furthermore, < f(x) < 1 for all iGl, the boundary of 
the set {iGl: f(x) > 0} has zero m measure, and for all I = 1,2, ... ,L we have 

llA'fll <W di f^! fA5 ) 

where c (A . 5 ) = C(A.5)(£, Z t ). 

Proof. Let /o be the function constructed in Theorem A.l for h(x) = g(x) = 1, and 
denote by V an arbitrary open set with the following properties: {x G X : f{x) > 
0} C V C {x G X : dist(x,i^) < 2e}, and m(5V) = 0. For example, one can take 
V = {x G X : dist(x, K) < r} for a suitable r G (e, 2e). 

Let 5j, j = 1, 2, . . ., be a family of balls contained in V D {x G X : /o(a;) < 1/2} such 
that twice smaller balls B'- form a countable covering of V R {x G X : /o(a?) < 1/2}, and 
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let fj be the function as in Corollary A. 2, equal to 1 on B'j and vanishing on X \ Bj. 
Finally, choose Ej > so that for I = 0, 1, . . . , L, 



22 £i 11^ ^IL°°(3t) < 2 H A M\l°°(x) ■ 
i=i 

Then f = fo + Y^Li e ifi nas a U the desired properties, with e replaced by 2e. □ 
Corollary A. 4. Assumption B holds with a-stable scaling. 

Proof. Given any compact subset K of an open set D C X, choose e > such that 
dist(X \D,K) > e. Since T>(A) C V(A), the function / given in Proposition A. 3 (for 
L = 1) satisfies all conditions of Assumption B. Furthermore, if v ri {s)ds is the Levy 
measure of the subordinator rj t , then 



Uf\\ 



poo poo 

/ (T s z f - f)u v (s)ds < / \\T S Z f - f\\ v v (s)ds 

Jo L°°(X) JO 

< J min ||A/|| ioo(x) ,2||/|| £ao(3E) ) v v {s)ds. 



Note that \\f\\ L °°{x) = 1- Furthermore, min(As,2) < ci(l-e- As ) (with c x = 2e 2 /(e 2 -l)), 



and 



Therefore, 



POO 

/ (1 - e- Xs )v v (s)ds = \ a/d " 
Jo 



a/d u 



\\Af\\ L l|A/|| 

Let < r < i?, and take A" = -B(a;o, r), -D = fi(x , R), e = R — r. We see that 

ii>i*ii ^ /c(A.5)(i, Z t )(2i?) n/2 \ V N D -a 
IIA/IIl-^ < c i ( ^5 — „^„+n/2 = ^ • 



This gives half of the a-stable scaling property (f), and the other half is proved in a 
similar manner. □ 
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